Brief Overall Description of the Dataset:
RealtyTrac is one of the nation’s largest databases. This data is collected from over 2,200 US counties recording foreclosure data, tax and assessor data, property characteristics, historical sale and loan data, value estimates, local amenities, local hazards, demographics, politics and economics. This dataset has been organized with the goal to better understand neighborhood trends and economics across the United States. Collected mostly from local jurisdictions and outside sources, such as CoreLogic, this dataset provides a picture of the American housing market as well as the demographics across the country. Upon further investigation, it was found that much of the RealtyTrac's data is bought directly from CoreLogic.
Link: http://www.realtytrac.com/
Date Inventory Completed: 05/22/2015
Screening
- Is the data collected opinion-based?
- Is the data collection recurring (must be collected at least annually)?
- Is there data available for 2013?
- Is the data collected at the property or housing unit level? property
- Can we access the data by August 15th?
Purpose
What is the purpose of the organization collecting the data?
This data is collected in order to perform social and housing analyses to better understand different neighborhoods across the United States.
Why is it collected and how does the organization use it?
This data is collected to provide information and analysis on housing and demographics across the United States and is used in multiple fields for research, policy change, modeling and analysis.
Who else uses the data?
Businesses, citizens and researchers
Who do they sell the data to?
Brokers, utilities, realtors, non-profits, investors, insurance, homebuilders, government, etc.
Method
What is the data collection method?
Collect and organize data from third party databases
What is the type of data collected?
Administrative
If designed, who created the questions?
What is the raw source of the collected data (prior to any aggregation)?
Some of the data from this dataset comes from another dataset, CoreLogic, the company also has researchers that collect data directly from source documents of local jurisdictions.
Description
What is the general topic of the data (1-2 words)?
Housing prices
What are the earliest and latest dates for which data is available?
2013-2015
Is data collected and available periodically?
Yes, available monthly
How soon after a reference period ends can a data source be prepared and provided?
About 22 days
Selectivity
What is the universe (e.g., population) that the data represents?
Houses in the United States
Accessibility
How is the data accessed?
It is not listed for the raw data, would find out if attempted to purchase, API is available, but listed for if the raw data was too much.
Is it open data?
No, it has to be purchased
Any legal, regulatory, or administrative restrictions on accessing the data source?
None listed
Cost? - One time or annual or project based payment?
None listed
Does this dataset appear to meet our needs for the Census study? YES
Full Inventory
Description
- What is the general contents of the data source?
Demographics, housing costs, property history, foreclosures, vacancies, property characteristics, value estimates, demographics, economics
- Features
- What is the temporal nature of the data: longitudinal, time-series, or one time point?
Time-series
- Geospatial? If Yes, at what level?
Addresses
Metadata
- Is there information available to assess the transparency and soundness of the methods to gather the data for our purposes?
Not available
- Is there a description of each variable in the source along with their valid values?
Not available
- Are there unique IDs for unique elements that can be used for linking data?
Yes, addresses
- Is there a data dictionary or codebook?
Yes
Selectivity
- What unit is represented at the record level of the data source?
Property
Does this universe match the stated intentions for the data collection? If not, what has been included or excluded and why?
Unknown
What is the sampling technique used (if applicable)?
Not listed
What was the coverage?
90% of the United States population
Stability/Coherence
- Were there any changes to the universe of data being captured (including geographical areas covered) and if so what were they?
No
- Were there any changes in the data capture method and if so what were they?
No
- Were there any changes in the sources of data and if so what were they?
No
Accuracy
- Any known sources of error?
None listed
- Describe any quality control checks performed by the data’s owner.
None listed
Accessibility
- Any records or fields collected, but not included in data source, such as for confidentiality reasons)?
None listed
- Is there a subset of variables and/or data that is must be obtained through a separate process? If yes, is there a separate legal, regulatory, or administrative restrictions on accessing the data source? Cost? - One time or annual or project based payment?
None listed
Privacy and security
- Was consent given by participant? If so, how was consent given?
Data was all purchased by outside source, so the consent was given in the purchase agreement.
- Are there legal limitations or restrictions on the use of the data?
No
- What confidentiality policies does the source have?
None listed
Research
- What research has been done with this dataset?
Economic, social, and real estate analyses
- Include any links to research if provided:
http://www.realtytrac.com/news/realtytrac-reports/
- List any other data use notes provided by the supplier.
N/A
Gaps/Concerns
- Feasibility - can all jurisdiction levels provide the data (if applicable)?
This data is available nationally, and being from outside sources, jurisdictions would easily be able to purchase it as well, if funds were available.
- Data ownership - a lack of clarity in legal guidance stemming from a lack of clarity with who owns digital data?
- Data collection authority - what data is reasonably private and what constitutes unwarranted intrusion?
- Describe any other notes you have or any gaps/concerns you see with this dataset:
For current relationship between RealtyTrac and Core logic, see: https://www.ftc.gov/news-events/press-releases/2014/03/ftc-puts-conditions-corelogic-incs-proposed-acquisition-dataquick