Data

Centrelink Files

La Trobe University
Alan Healey (Aggregated by)
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=info:doi10.26181/5ec7502948caa&rft.title=Centrelink Files&rft.identifier=10.26181/5ec7502948caa&rft.publisher=La Trobe University&rft.description=Raw data scraped from the Centrelink website on March 11, 2019. Zip file contains 3 folders: Folders, Pages, and Text. Pages contains each individual webpage, with HTML code.The Folders and Text folders both contain processed text files that have been classified by section (Folders) or not (Text). Individual files outside of these folders were found to cause problems during processing (such as using non-ASCII characters.&rft.creator=Alan Healey&rft.date=2020&rft_rights= https://creativecommons.org/licenses/by-sa/4.0/&rft_rights=The Author reserves all moral rights over the deposited text and must be credited if any re-use occurs. Documents deposited in OPAL are the Open Access versions of outputs published elsewhere. Changes resulting from the publishing process may therefore not be reflected in this document. The final published version may be obtained via the publisher’s DOI. Please note that additional copyright and access restrictions may apply to the published version.&rft_subject=Linguistics not elsewhere classified&rft_subject=Centrelink&rft_subject=Linguistics&rft.type=dataset&rft.language=English Access the data

Licence & Rights:

Other view details
Cc By-sa 4.0

https://creativecommons.org/licenses/by-sa/4.0/

The Author reserves all moral rights over the deposited text and must be credited if any re-use occurs. Documents deposited in OPAL are the Open Access versions of outputs published elsewhere. Changes resulting from the publishing process may therefore not be reflected in this document. The final published version may be obtained via the publisher’s DOI. Please note that additional copyright and access restrictions may apply to the published version.

Access:

Other

Full description

Raw data scraped from the Centrelink website on March 11, 2019.

Zip file contains 3 folders: Folders, Pages, and Text.

Pages contains each individual webpage, with HTML code.
The Folders and Text folders both contain processed text files that have been classified by section (Folders) or not (Text).

Individual files outside of these folders were found to cause problems during processing (such as using non-ASCII characters.

Issued: 22 05 2020

This dataset is part of a larger collection

Click to explore relationships graph
Subjects

User Contributed Tags    

Login to tag this record with meaningful keywords to make it easier to discover

Identifiers