VT Census Case Studies : Arlington County: CPHD data

Brief Overall Description of the Dataset:

CPHD data (ATRACK, Development Tracking, Forecast Round 8.4, and apartment data compiled for a separate project) sent via the FTP.

ATRACK data include information on committed affordable units (CAF). 2013 ATRACK data, which tracks rental apartments) is incorporated into "Housing count data." Housing Units data was put together for a separate project that looks at projecting school attendance rates (more families living in rental units, which are not included in algorithms used to project student numbers... under predicting)

Will send community features for CAFs.

Link: http://departments.arlingtonva.us/planning-housing-development/

Date Inventory Completed: 7/15/2015

Screening

  • Is the data collected opinion-based?
  • Is the data collection recurring (must be collected at least annually)?
  • Is there data available for 2013?
  • Is the data collected at the property or housing unit level?
  • Can we access the data by August 15th?

Purpose

  • What is the purpose of the organization collecting the data?

The Department of Community Planning, Housing, and Development (CPHD) is responsible for the development review process; comprehensive planning; zoning administration; inspections and code enforcement and data analysis for Arlington County. The department is responsible for planning both in Arlington’s neighborhoods and in the densely developed, transit oriented Metro corridors. CPHD is the lead department in implementing the County’s Smart Growth planning vision.

  • Why is it collected and how does the organization use it?

The data is collected for tax, fees, and/or planning purposes, depending on the specific data.

ATRACK survey is conducted to find out rent per bedroom size and track stock of affordability in county (report annually about this)-- units going for 60% and 50% of medium income.

  • Who else uses the data?

In raw form, the data is used throughout the county. The data also forms the basis of other commercial and federal databases.

  • Who do they sell the data to?

Commercial/non profit entities who repurpose the data to create their own databases or products.


Method

  • What is the data collection method? 

Atrack is self reported data through a survey.

Housing Unit Count, development tracking, and Forecast Round 8.4 are put together by CPHD researchers.

  • What is the type of data collected? 

Administrative

  • If designed, who created the questions?

  • What is the raw source of the collected data (prior to any aggregation)? 

Varied by source (e.g. self reported rent information)


Description

  • What is the general topic of the data (1-2 words)?

Development, housing stock, planning

  • What are the earliest and latest dates for which data is available?

2015 data set for ATRACK. Franklin will look to see if historical data can be easily pulled (if easy, will get 2009-2013 data, if not just 2013 data)

  • Is data collected and available periodically?

Yes and varies by database.

  • How soon after a reference period ends can a data source be prepared and provided? 

Unknown


Selectivity

  • What is the universe (e.g., population) that the data represents?

Housing units in Arlington


Accessibility

  • How is the data accessed? 

Varies.

Housing Unit by Count, Development tracking, Forecast ROund 8.4 was transfered through SDAL FTP

Atrack will require asking for exactly what we needed (i.e. a codebook and which variables)

Permitting System will be require being explicit yet comprehensive the first round. (We were told this initially, but then found API)

  • Is it open data?

The databases are put together by the CPHD for their own projects and/or for contractors. The data itself is public, but not available online.

  • Any legal, regulatory, or administrative restrictions on accessing the data source?

Nothing stated explicit. As agents of the state, we must protect the data like other county agencies.

  • Cost? - One time or annual or project based payment?

Free

Does this dataset appear to meet our needs for the Census study? YES

Full Inventory

Description

  • Features
    • What is the temporal nature of the data: longitudinal, time-series, or one time point?

Longitudinal

    • Geospatial? If Yes, at what level?

Varies: housing address, parcel, or to census block/tract


Metadata

  • Is there information available to assess the transparency and soundness of the methods to gather the data for our purposes?

No

  • Is there a description of each variable in the source along with their valid values?

Yes

  • Are there unique IDs for unique elements that can be used for linking data?

Yes

  • Is there a data dictionary or codebook?

Codebook was emailed along with the data for Housing Unit, Alterations and Additions, and Development Tracking data

 

Selectivity

  • What unit is represented at the record level of the data source?

Varies by database.

  • Does this universe match the stated intentions for the data collection? If not, what has been included or excluded and why?

Atrack only has those complexes that responded to the survey to self report information.

  • What is the sampling technique used (if applicable)? 

  • What was the coverage?

Unknown


Stability/Coherence

  • Were there any changes to the universe of data being captured (including geographical areas covered) and if so what were they?

Unknown

  • Were there any changes in the data capture method and if so what were they? 

Unknown

  • Were there any changes in the sources of data and if so what were they? 

Unknown


Accuracy

  • Any known sources of error?

Unknown

  • Describe any quality control checks performed by the data’s owner.

Data gathered by the CPHD to create their own databases and measures are cleaned and made uniform by CPHD researchers.

 

Accessibility

  • Any records or fields collected, but not included in data source, such as for confidentiality reasons)? 

Unknown

  • Is there a subset of variables and/or data that is must be obtained through a separate process? If yes, is there a separate legal, regulatory, or administrative restrictions on accessing the data source? Cost? - One time or annual or project based payment?

Unknown


Privacy and security

  • Was consent given by participant? If so, how was consent given?

Unknown

  • Are there legal limitations or restrictions on the use of the data? 

Not stated

  • What confidentiality policies does the source have? 

As agents of the county, we need to treat the data like any other county employee in regards to confidentiality and security.

 

Research

  • What research has been done with this dataset? (e.g., impact of policies, predictors of student success)

Development, affordable housing, zoning, community facilities, visions.

  • Include any links to research if provided:

See: http://departments.arlingtonva.us/planning-housing-development/

http://projects.arlingtonva.us/data-research/

  • List any other data use notes provided by the supplier.

 

Gaps/Concerns

  • Feasibility - can all jurisdiction levels provide the data (if applicable)?
  • Data ownership - a lack of clarity in legal guidance stemming from a lack of clarity with who owns digital data?
  • Data collection authority - what data is reasonably private and what constitutes unwarranted intrusion?
  • Describe any other notes you have or any gaps/concerns you see with this dataset: