Brief Overall Description of the Dataset:
Data on mortgages acquired by Fannie Mae and Freddie Mac,. The Public Use Database (PUDB) single-family data set includes detailed information such as the income, race, and gender of the borrower as well as the census tract location of the property, loan-to-value ratio, age of mortgage note, and affordability of the mortgage. The PUDB multifamily property-level data set includes information on the size of the property, unpaid principal balance, and type of seller/servicer from which the Enterprise acquired the mortgage. The multifamily unit-class file also includes information on the number and affordability of the units in the property. Both the single-family and multifamily data include indicators of whether the purchases are from “underserved” census tracts, as defined in terms of median income and minority percentage of population.
Link: http://www.fhfa.gov/DataTools
http://www.fhfa.gov/DataTools/Downloads/Pages/Public-Use-Databases.aspx
Date Inventory Completed: 7/11/2015
Screening
- Is the data collected opinion-based?
- Is the data collection recurring (must be collected at least annually)?
- Is there data available for 2013?
- Is the data collected at the property or housing unit level?
- Can we access the data by August 15th?
Purpose
What is the purpose of the organization collecting the data?
The Federal Housing Finance Agency is an independent federal agency. It regulates Fannie Mae, Freddie Mac, and the 12 Federal Home Loan Banks (FHLBanks, or FHLBank System).
Why is it collected and how does the organization use it?
To provide information concerning the flow of mortgage credit and capital in America’s communities
Who else uses the data?
Mortgage lenders, planners, researchers, and housing advocates
Who do they sell the data to?
Free
Method
What is the data collection method?
Operator entry
What is the type of data collected?
Administrative
If designed, who created the questions?
What is the raw source of the collected data (prior to any aggregation)?
Mortgage filings
Description
What is the general topic of the data (1-2 words)?
Freddie Mac and Fannie Mae Mortgages
What are the earliest and latest dates for which data is available?
2008-2013
Is data collected and available periodically?
Yes
How soon after a reference period ends can a data source be prepared and provided?
Not stated
Selectivity
What is the universe (e.g., population) that the data represents?
Accessibility
- How is the data accessed?
Mortgage by Fannie Mae and Freddie Mac
- Is it open data?
Yes
Any legal, regulatory, or administrative restrictions on accessing the data source?
None stated
Cost? - One time or annual or project based payment?
Free
Does this dataset appear to meet our needs for the Census study? Yes
Full Inventory
Description
- Features
- What is the temporal nature of the data: longitudinal, time-series, or one time point?
Time-series
- Geospatial? If Yes, at what level?
Census tract
Metadata
- Is there information available to assess the transparency and soundness of the methods to gather the data for our purposes?
No
- Is there a description of each variable in the source along with their valid values?
Yes
- Are there unique IDs for unique elements that can be used for linking data?
Yes
- Is there a data dictionary or codebook?
Selectivity
What unit is represented at the record level of the data source?
Mortgage
Does this universe match the stated intentions for the data collection? If not, what has been included or excluded and why?
Unknown
What is the sampling technique used (if applicable)?
What was the coverage?
Unknown
Stability/Coherence
- Were there any changes to the universe of data being captured (including geographical areas covered) and if so what were they?
Unknown
- Were there any changes in the data capture method and if so what were they?
Unknown
- Were there any changes in the sources of data and if so what were they?
Unknown
Accuracy
- Any known sources of error?
Unknown
- Describe any quality control checks performed by the data’s owner.
Unknown
Accessibility
- Any records or fields collected, but not included in data source, such as for confidentiality reasons)?
Unknown
- Is there a subset of variables and/or data that is must be obtained through a separate process? If yes, is there a separate legal, regulatory, or administrative restrictions on accessing the data source? Cost? - One time or annual or project based payment?
No
Privacy and security
- Was consent given by participant? If so, how was consent given?
Not stated (possibly in mortgage documents)
- Are there legal limitations or restrictions on the use of the data?
None stated
- What confidentiality policies does the source have?
None stated
Research
- What research has been done with this dataset? (e.g., impact of policies, predictors of student success)
None stated
- Include any links to research if provided:
- List any other data use notes provided by the supplier.
Gaps/Concerns
- Feasibility - can all jurisdiction levels provide the data (if applicable)?
- Data ownership - a lack of clarity in legal guidance stemming from a lack of clarity with who owns digital data?
- Data collection authority - what data is reasonably private and what constitutes unwarranted intrusion?
- Describe any other notes you have or any gaps/concerns you see with this dataset: