Synthetic Priority Investment Approach Data, 2001-2015

Version 1.1

Department of Social Services; Data61, 2017, "Synthetic Priority Investment Approach Data, 2001-2015", https://doi.org/10.4225/87/FASD1J, ADA Dataverse, V1

Learn about Data Citation Standards.

Contact Owner

Dataset Metrics

4,222 Downloads

Description	The Australian Priority Investment Approach to Welfare (PIA) policy initiative was established as part of the 2015-16 Budget, following a comprehensive review of Australia’s welfare system. The initiative uses data analysis to identify groups at risk of long-term welfare dependence. These analyses provide insights into how the system is working and uses those insights to find innovative ways of helping more Australians live independently of welfare. As part of the PIA, in September 2016, the Minister for Social Services announced a plan to allow limited public access to PIA data. A synthetic version of the PIA data has been created for use by researchers and teachers. The synthetic data relates to individuals who have made a claim for, are receiving or have received payments or services administered under social security law and family assistance law. This includes benefit types such as Aged Pension, Youth Allowance, Newstart and Disability Support Pension. The synthetic data contains a limited number of variables suitable for research, while maintaining the privacy and confidentiality of individuals. The synthetic dataset has been created by applying a privacy-preserving algorithm on the original PIA data. This process results in each person’s true data being modified such that the overall group data very closely represents that of the original dataset, yet no one individual’s data can be identified in the synthetic dataset. That is, each line of data that would normally represent an individual no longer does. The dataset is a combination of synthetic records that, when combined, reflect the shape of the original dataset. The synthetic PIA data contains a series of point-in-time quarterly snapshots dated from July 2001 to June 2015. This results in 56 separate quarters of administrative data. Each quarter includes 31 variables (available in the ‘PIA Data Dictionary – Variable and Codes’ file) that are consistent across all quarters. There are approximately 5 million individual records in each quarter.
Subject	Social Sciences
Keyword	Family allowances, Public administration, Social policy, Social welfare
Notes	The Synthetic PIA Data files are loaded as 56 separate zipped .csv files to reduce the user’s required resources for download. Please note, software programs like Excel are not recommended for use with the Synthetic PIA Data files due to the limitation of 1,048,576 rows. That is, the full dataset with approximately 5 million records will not load in Excel.
License/Data Use Agreement	Custom Dataset Terms

Filter by

	1 to 10 of 58 Files	Download
	PIA Data Dictionary - Variable and Codes.XLSX.zip ZIP Archive - 60.4 KB Published Dec 18, 2017 322 Downloads MD5: d40e728087a06dd6dc045f51e12c7e3b PIA Data Dictionary - Variable and Codes for Synthetic Data of the Priority Investment Approach (PIA) dataset (*.XLSX format) DocumentationSyntheticPIA	Access File File Access Public Download Options ZIP Archive Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	Synthetic_Data_User_Guide.pdf Adobe PDF - 469.0 KB Published Dec 18, 2017 346 Downloads MD5: 0a1bdc01444c1fe6ef1fd09072d8b317 User guide for the PIA Synthetic Dataset DocumentationSyntheticPIA	Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	synthetic_pia_csv_qtr_2001-07-01.csv.zip ZIP Archive - 81.8 MB Published Dec 18, 2017 128 Downloads MD5: f3826852645667124281074c1f285dd9 Synthetic PIA data file - 2001 Q3 - CSV format DataSyntheticPIA	Access File File Access Restricted Users may not request access to files. Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	synthetic_pia_csv_qtr_2001-10-01.csv.zip ZIP Archive - 82.0 MB Published Dec 18, 2017 92 Downloads MD5: de98cb09214a381a09b11d15e8273a29 Synthetic PIA data file - 2001 Q4 - CSV format DataSyntheticPIA	Access File File Access Restricted Users may not request access to files. Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	synthetic_pia_csv_qtr_2002-01-01.csv.zip ZIP Archive - 83.2 MB Published Dec 18, 2017 86 Downloads MD5: 2bffbb309fe32386adbec84be4c05a49 Synthetic PIA data file - 2002 Q1 - CSV format DataSyntheticPIA	Access File File Access Restricted Users may not request access to files. Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	synthetic_pia_csv_qtr_2002-04-01.csv.zip ZIP Archive - 82.0 MB Published Dec 18, 2017 83 Downloads MD5: e4ddab812feebf6405f3355b03a6fd9c Synthetic PIA data file - 2002 Q2 - CSV format DataSyntheticPIA	Access File File Access Restricted Users may not request access to files. Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	synthetic_pia_csv_qtr_2002-07-01.csv.zip ZIP Archive - 81.6 MB Published Dec 18, 2017 82 Downloads MD5: 60187b1cee17e22717f3ed0106bcbbeb Synthetic PIA data file - 2002 Q3 - CSV format DataSyntheticPIA	Access File File Access Restricted Users may not request access to files. Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	synthetic_pia_csv_qtr_2002-10-01.csv.zip ZIP Archive - 82.6 MB Published Dec 18, 2017 85 Downloads MD5: b71ca3dbf6878937ee647dbeeb795fc1 Synthetic PIA data file - 2002 Q4 - CSV format DataSyntheticPIA	Access File File Access Restricted Users may not request access to files. Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	synthetic_pia_csv_qtr_2003-01-01.csv.zip ZIP Archive - 83.6 MB Published Dec 18, 2017 81 Downloads MD5: c8567e4e08f0847a55f36210dd251eb5 Synthetic PIA data file - 2003 Q1 - CSV format DataSyntheticPIA	Access File File Access Restricted Users may not request access to files. Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX
	synthetic_pia_csv_qtr_2003-04-01.csv.zip ZIP Archive - 83.8 MB Published Dec 18, 2017 79 Downloads MD5: da82a2282eca1d74cb6c4a0bf52bdfa2 Synthetic PIA data file - 2003 Q2 - CSV format DataSyntheticPIA	Access File File Access Restricted Users may not request access to files. Download Metadata Data File Citation Download EndNote XML Download RIS Download BibTeX

Citation Metadata

Persistent Identifier	doi:10.4225/87/FASD1J
Publication Date	2017-12-18
Title	Synthetic Priority Investment Approach Data, 2001-2015
Other Identifier	ADA: 01444
Author	Department of Social Services (Australian Government) Data61 (Australian Government)
Point of Contact	Use email button above to contact. Australian Data Archive (Australian National University)
Description	The Australian Priority Investment Approach to Welfare (PIA) policy initiative was established as part of the 2015-16 Budget, following a comprehensive review of Australia’s welfare system. The initiative uses data analysis to identify groups at risk of long-term welfare dependence. These analyses provide insights into how the system is working and uses those insights to find innovative ways of helping more Australians live independently of welfare. As part of the PIA, in September 2016, the Minister for Social Services announced a plan to allow limited public access to PIA data. A synthetic version of the PIA data has been created for use by researchers and teachers. The synthetic data relates to individuals who have made a claim for, are receiving or have received payments or services administered under social security law and family assistance law. This includes benefit types such as Aged Pension, Youth Allowance, Newstart and Disability Support Pension. The synthetic data contains a limited number of variables suitable for research, while maintaining the privacy and confidentiality of individuals. The synthetic dataset has been created by applying a privacy-preserving algorithm on the original PIA data. This process results in each person’s true data being modified such that the overall group data very closely represents that of the original dataset, yet no one individual’s data can be identified in the synthetic dataset. That is, each line of data that would normally represent an individual no longer does. The dataset is a combination of synthetic records that, when combined, reflect the shape of the original dataset. The synthetic PIA data contains a series of point-in-time quarterly snapshots dated from July 2001 to June 2015. This results in 56 separate quarters of administrative data. Each quarter includes 31 variables (available in the ‘PIA Data Dictionary – Variable and Codes’ file) that are consistent across all quarters. There are approximately 5 million individual records in each quarter.
Subject	Social Sciences
Keyword	Family allowances (APAIS) http://www.vocabularyserver.com/apais/index.php?tema=829 Public administration (APAIS) http://www.vocabularyserver.com/apais/index.php?tema=1749 Social policy (APAIS) http://www.vocabularyserver.com/apais/index.php?tema=2011 Social welfare (APAIS) http://www.vocabularyserver.com/apais/index.php?tema=2022
Notes	The Synthetic PIA Data files are loaded as 56 separate zipped .csv files to reduce the user’s required resources for download. Please note, software programs like Excel are not recommended for use with the Synthetic PIA Data files due to the limitation of 1,048,576 rows. That is, the full dataset with approximately 5 million records will not load in Excel.
Language	English
Producer	Department of Social Services (Australian Government) (DSS) https://www.dss.gov.au/
Distributor	Australian Data Archive (Australian National University) (ADA) http://ada.edu.au
Distribution Date	2017-12-18
Depositor	Neuendorf, Annette
Deposit Date	2017-11-24
Time Period	Start Date: 2001-07-01; End Date: 2015-06-30
Date of Collection	Start Date: 2001-07-01; End Date: 2015-06-30
Data Type	Synthetic data
Related Dataset	Priority Investment Approach Dataset: Unit record level data of the original PIA data is available to eligible users via a secure, password-protected online remote access research gateway. This access point enables users to analyse linked datasets remotely via Secure Unified Research Environment (SURE) through the Australian Institute of Health and Welfare (AIHW). Information regarding this access is available on the AIHW website. Application packs can be requested from pia.dataset@aihw.gov.au; TableBuilder: A subset of the PIA data is hosted on the Australian Bureau of Statistics’ (ABS) TableBuilder data analytics tool. TableBuilder allows public users to create tables that display aggregate (or group level) data from a subset of the PIA dataset without ever seeing unit record data. Information regarding this access is available on the ABS website. Registration is available at http://abs.gov.au/registration.

Geospatial Metadata

Geographic Coverage	Australia
Geographic Unit	Country

Social Science and Humanities Metadata

Unit of Analysis	Individual
Universe	The Synthetic Data relates to individuals who have made a claim for, are receiving or have received payments or services administered under social security law and family assistance law. This includes benefit types such as Aged Pension, Youth Allowance, Newstart and Disability Support Pension.
Time Method	Time series
Frequency	Quarterly
Sampling Procedure	No sampling - total population
Collection Mode	Compilation or synthesis of existing material
Weighting	None
Response Rate	Not applicable
Notes	Note Funding The generation of these synthetic datasets was funded through the National Innovation Science Agenda (NISA) - Data61 Platforms for Open Data Program.

Dataset Terms

License/Data Use Agreement

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.

Custom Dataset Terms - the following Custom Dataset Terms have been defined for this dataset.

1. I acknowledge that I will not distribute, share or disseminate any of the files available from this page.

2. I also acknowledge that synthetic data is not suitable for the purpose of preparing population estimates.

3. I agree that if requested by Department of Social Services or Australian Data Archive, I will delete all versions of the synthetic data that I have access to.

4. I acknowledge that these terms and conditions are in addition to the Australian Data Archive General Terms and Conditions of Use I have previously agreed to on registration with the ADA Dataverse.

By submitting an access request, I am agreeing to these Terms and Conditions of Use

Restricted Files + Terms of Access

Restricted Files

There are 56 restricted files in this dataset.

Terms of Access for Restricted Files

To request download access for restricted files in this data set, please select the file(s) and click the "Request Access" button.

By clicking ‘Request Access’ you are agreeing to the Terms & Conditions of Use.

Request Access

Users may not request access to files.

Guestbook

The following guestbook will prompt a user to provide additional information when downloading a file.

PIA_Guestbook_2017

Dataset Version	Summary	Contributors	Published on
No records found.

Edit File

This file has already been deleted (or replaced) in the current version. It may not be edited.

Restrict Access

Restricting limits access to published files. People who want to use the restricted files can request access by default. If you disable request access, you must add information about access to the Terms of Access field.

Learn about restricting files and dataset access in the User Guide.

Request Access

Enable access request

You must enable request access or add terms of access to restrict file access.

Terms of Access for Restricted Files

Save Changes

Edit Embargo

The selected file or files have already been published. Contact an administrator to change the embargo date or reason of the file or files.

Edit Retention Period

The selected file or files have already been published. Contact an administrator to change the retention period date or reason of the file or files.

Delete Files

The file will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the dataset.

Select File(s)

Please select one or more files.

Share Dataset

Share this dataset on your favorite social media networks.

Continue

Dataset Citations

Citations for this dataset are retrieved from Crossref via DataCite using Make Data Count standards. For more information about dataset metrics, please refer to the User Guide.

Sorry, no citations were found.

Inaccessible Files Selected

The selected file(s) may not be downloaded because you have not been granted access or the file(s) have a retention period that has expired or the files can only be transferred via Globus.

Ineligible Files Selected

The selected file(s) may not be transferred because you have not been granted access or the file(s) have a retention period that has expired or the files are not Globus accessible.

Download Options

The files selected are too large to download as a ZIP.

You can select individual files that are below the 100.0 MB download limit from the files table, or use the Data Access API for programmatic access to the files.

Select File(s)

Please select a file or files to be downloaded.

Inaccessible Files Selected

The selected file(s) may not be downloaded because you have not been granted access or the file(s) have a retention period that has expired.

Click Continue to download the files you have access to download.

Ineligible Files Selected

Some file(s) cannot be transferred. (They are restricted, embargoed, with an expired retention period, or not Globus accessible.)

Click Continue to transfer the elligible files.

Delete Dataset

Are you sure you want to delete this dataset and all of its files? You cannot undelete this dataset.

Delete Draft Version

Are you sure you want to delete this draft version? Files will be reverted to the most recently published version. You cannot undelete this draft.

Unpublished Dataset Preview URL

Preview URL can only be used with unpublished versions of datasets.

Unpublished Dataset Preview URL

Are you sure you want to disable the Preview URL? If you have shared the Preview URL with others they will no longer be able to use it to access your unpublished dataset.

Delete Files

The file(s) will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the dataset.

Compute

This dataset contains restricted files you may not compute on because you have not been granted access.

Deaccession Dataset

Are you sure you want to deaccession? This is permanent and the selected version(s) will no longer be viewable by the public.

Deaccession Dataset

Are you sure you want to deaccession this dataset? This is permanent an it will no longer be viewable by the public.

Version Differences Details

Please select two versions to view the differences.

Version Differences Details

Version:
Last Updated:

Select File(s)

Please select a file or files for access request.

Select File(s)

Embargoed files cannot be accessed. Please select an unembargoed file or files for your access request.

Edit Tags

Select existing file tags or create new tags to describe your files. Each file can have more than one tag.

Request Access

You need to Sign Up or Log In to request access.

Dataset Terms

Please confirm and/or complete the information needed below in order to request access to files in this dataset.

This dataset is made available under the following terms. Please confirm and/or complete the information needed below in order to continue.

License/Data Use Agreement

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.

Custom terms specific to this dataset Custom Dataset Terms - the following Custom Dataset Terms have been defined for this dataset.

1. I acknowledge that I will not distribute, share or disseminate any of the files available from this page.

2. I also acknowledge that synthetic data is not suitable for the purpose of preparing population estimates.

3. I agree that if requested by Department of Social Services or Australian Data Archive, I will delete all versions of the synthetic data that I have access to.

By submitting an access request, I am agreeing to these Terms and Conditions of Use

Name

Institution

Position

Additional Questions

Do you live in Australia?

If you live in Australia, please provide your postcode.

If you live outside Australia, please indicate which country you live in.

What is your primary intended use of this data?

Preview Guestbook

Upon downloading files the guestbook asks for the following information.

Guestbook Name

Collected Data

Account Information

Package File Download

Use the Download URL in a Wget command or a download manager to download this package file. Download via web browser is not recommended. User Guide - Downloading a Dataverse Package via URL

Download URL

https://dataverse.ada.edu.au/api/access/datafile/

Compute Batch

Clear Batch

Dataset	Persistent Identifier	Change Compute Batch

Compute Batch

Submit for Review

You will not be able to make changes to this dataset while it is in review.

Publish Dataset

Are you sure you want to republish this dataset?

Select if this is a minor or major version update.

Minor Release (1.2)

Major Release (2.0)

Publish Dataset

This dataset cannot be published until PIA Synthetic Data Dataverse is published by its administrator.

Publish Dataset

This dataset cannot be published until PIA Synthetic Data Dataverse and ADA Dataverse are published.

Return to Author

Return this dataset to contributor for modification. The reason for return entered below will be sent by email to the author.

Curation Status History

Status	Date	Assigner
No records found.

Add/Edit a Version Note

Styled Citation