VT Census Case Studies : American Housing Survey

Brief Overall Description of the Dataset:

This dataset is a collection of survey results on American Housing. The survey is given out every odd year to a random selection of American Households. Operated through the US Census and the U.S. Department of Housing and Urban Development, this survey asks American Households their tenure status, demographics and housing characteristics. Data and summaries are available publicly by August 15, 2015.

Link: http://www.census.gov/programs-surveys/ahs/about.html

Date Inventory Completed: 06/16/2015

Screening

  • Is the data collected opinion-based?
  • Is the data collection recurring (must be collected at least annually)?
  • Is there data available for 2013?
  • Is the data collected at the property or housing unit level?
  • Can we access the data by August 15th?

Purpose

  • What is the purpose of the organization collecting the data?

“To provide a current and continuous series of data on selected housing and demographic characteristics”

  • Why is it collected and how does the organization use it?

“Policy analysts, program managers, budget analysts, and Congressional staff use AHS data to monitor supply and demand, as well as changes in housing conditions and costs, in order to assess housing needs. Analyses based on the AHS are used to advise the executive and legislative branches in the development of housing policies. HUD uses the AHS to improve efficiency and effectiveness and design housing programs appropriate for different target groups, such as first-time home buyers and the elderly. Academic researchers and private organizations also use AHS data in efforts of specific interest and concern to their respective communities.”

  • Who else uses the data?

Businesses, researchers, policy makers

  • Who do they sell the data to?

Anyone


Method

  • What is the data collection method? 

Survey

 See: http://www.census.gov/programs-surveys/ahs/about/methodology.html

  • What is the type of data collected? 

Designed collection

  • If designed, who created the questions?

US Department of Housing and Urban Development

  • What is the raw source of the collected data (prior to any aggregation)? 

Self-report


Description

  • What is the general topic of the data (1-2 words)?

Characteristics and tenure of houses in the US

  •  What are the earliest and latest dates for which data is available?

1973-2015

  • Is data collected and available periodically?

“The AHS is conducted biennially between May and September in odd-numbered years. We collect data between the months of May and September. HUD sometimes adjusts this schedule and/or sample depending on budget constraints.”

  • How soon after a reference period ends can a data source be prepared and provided? 

Unknown


Selectivity

  • What is the universe (e.g., population) that the data represents?

Houses in the US


Accessibility

  • How is the data accessed? 

Download

  • Is it open data?

Yes

  • Any legal, regulatory, or administrative restrictions on accessing the data source?

N/A

  • Cost? - One time or annual or project based payment?

No

 

Does this dataset appear to meet our needs for the Census study? Yes

Full Inventory

Description

  • Features
    • What is the temporal nature of the data: longitudinal, time-series, or one time point?

Longitudinal

    • Geospatial? If Yes, at what level?

Census Tracts

 

Metadata

  • Is there information available to assess the transparency and soundness of the methods to gather the data for our purposes?

Yes, see: http://www.census.gov/programs-surveys/ahs/tech-documentation/ahs-definitions--errors--historical-changes--and-sample-design--.html 

  • Is there a description of each variable in the source along with their valid values?

http://www.census.gov/programs-surveys/ahs/tech-documentation/AHSVarnames_revised.html

  • Are there unique IDs for unique elements that can be used for linking data?

Yes

  • Is there a data dictionary or codebook?

http://www.census.gov/programs-surveys/ahs/tech-documentation.html


Selectivity

  • What unit is represented at the record level of the data source?

Household

  • Does this universe match the stated intentions for the data collection? If not, what has been included or excluded and why?

  • What is the sampling technique used (if applicable)? 

Random

  • What was the coverage?

Unknown

 

Stability/Coherence

  • Were there any changes to the universe of data being captured (including geographical areas covered) and if so what were they?

Census tracts did change over the years, and new metropolitan area shifts occurred. See for other changes: http://www2.census.gov/programs-surveys/ahs/2013/2013%20Historical%20Changes.pdf

  • Were there any changes in the data capture method and if so what were they? 

Yes, until 1985 the survey was collected annually but since then it has only been collected every other year.

  • Were there any changes in the sources of data and if so what were they? 

No

 

Accuracy

  • Any known sources of error?

Potential incorrect recording or wrong answers selected. See: http://www2.census.gov/programs-surveys/ahs/2013/National%20Appendix%20D%202013.pdf and http://www2.census.gov/programs-surveys/ahs/2013/Metropolitan%20Appendix%20D.%20Errors%202013.pdf

  • Describe any quality control checks performed by the data’s owner.

See above links

 

Accessibility

  • Any records or fields collected, but not included in data source, such as for confidentiality reasons)? 

Unknown

  • Is there a subset of variables and/or data that is must be obtained through a separate process? If yes, is there a separate legal, regulatory, or administrative restrictions on accessing the data source? Cost? - One time or annual or project based payment?

No

 

Privacy and security

  • Was consent given by participant? If so, how was consent given?

Yes, by turning in form

  • Are there legal limitations or restrictions on the use of the data? 

No

  • What confidentiality policies does the source have? 

Does not include PII

 

Research

  • What research has been done with this dataset? (e.g., impact of policies, predictors of student success)

Household trends, demographic trends

  • Include any links to research if provided:

http://www.census.gov/programs-surveys/ahs/research/publications.html

  • List any other data use notes provided by the supplier.

Gaps/Concerns

  • Feasibility - can all jurisdiction levels provide the data (if applicable)?
  • Data ownership - a lack of clarity in legal guidance stemming from a lack of clarity with who owns digital data?
  • Data collection authority - what data is reasonably private and what constitutes unwarranted intrusion?
  • Describe any other notes you have or any gaps/concerns you see with this dataset: