VT Census Case Studies : Real Capital Analytics

Brief Overall Description of the Dataset:

Real Capital Analytics records investment activity for the commercial real estate industry, looking mostly at apartments, hotels, offices and retail locations. Their report of repeat-sales regression indices for commercial real estate is referred to as the Moody’s/REAL CPPI. In 2015, RCA began to quantify the price value of walkability for commercial properties in the RCA & Walk Score CPPI.  These datasets look at the distance and walkability from apartments and hotels to places of school, work and entertainment, in comparison to housing cost.

Link: https://www.rcanalytics.com/Public/rca_cppi.aspx

Date Inventory Completed: 05/22/2015

Screening

  • Is the data collected opinion-based?
  • Is the data collection recurring (must be collected at least annually)?
  • Is there data available for 2013?
  • Is the data collected at the property or housing unit level?
  • Can we access the data by August 15th?

Purpose

  • What is the purpose of the organization collecting the data?

The purpose of collecting this data is to track trends in the commercial property market across the country.

  • Why is it collected and how does the organization use it?

The data is collected in order to allow lenders and investors best navigate the large commercial property market across the country.

  • Who else uses the data?

Investors, lenders, businesses

  • Who do they sell the data to?

Investors, lenders, businesses

 

Method

  • What is the data collection method? 

Data is collected from other third party databases, which are not listed on their website.

  • What is the type of data collected? 

Administrative data

  • If designed, who created the questions?

  • What is the raw source of the collected data (prior to any aggregation)? 

Researchers had linked with other databases to create their reports.


Description

  • What is the general topic of the data (1-2 words)?

Commercial real-estate prices 

  • What are the earliest and latest dates for which data is available?

December 2000 - April 2015

  • Is data collected and available periodically?

monthly

  • How soon after a reference period ends can a data source be prepared and provided? 

Approximately a month


Selectivity

  • What is the universe (e.g., population) that the data represents?

Commercial, multifamily housing and real estate in the United States


Accessibility

  • How is the data accessed? 

  • Is it open data?

Not all, some basic overview data is available on the website 

  • Any legal, regulatory, or administrative restrictions on accessing the data source?

None listed

  • Cost? - One time or annual or project based payment?

For entire United States (local unknown) one year subscription- $25,000 for a full year subscription which includes the monthly reports, and an extra $35,000 for the actual raw data to work with.


Does this dataset appear to meet our needs for the Census study? YES

Explanation:

This dataset adds a deeper level of insight into the large scale commercial real-estate market, rather than the single-family home market. The data will be available as it is needed, and will be up to date.

Full Inventory

Description

  • What is the general contents of the data source?

Volumes of properties closed/ in contract (in $mil), number of properties sold, average price of sold properties

  • Features
    • What is the temporal nature of the data: longitudinal, time-series, or one time point?

Time-series

    • Geospatial? If Yes, at what level?

Yes, at a state level


Metadata

  • Is there information available to assess the transparency and soundness of the methods to gather the data for our purposes?

“While the transparency of investment markets and availability of information varies greatly by country and market, RCA works hard to capture all relevant information possible. The information available through RCA is far more comprehensive than anything currently available in the public domain. Although mistakes and omissions do occur, the information is updated and revised continuously.”Is there a description of each variable in the source along with their valid values?

  • Are there unique IDs for unique elements that can be used for linking data?

No

  • Is there a data dictionary or codebook?

No

 

Selectivity

  • What unit is represented at the record level of the data source? 

Property

  • Does this universe match the stated intentions for the data collection? If not, what has been included or excluded and why?

Uknown

  • What is the sampling technique used (if applicable)?

RCA research is concentrated on commercial property and portfolio sales of $2.5 million or greater in the US, and $10 million or greater outside of the US.” Other than that, none listed.

  • What was the coverage?

Not listed


Stability/Coherence

  • Were there any changes to the universe of data being captured (including geographical areas covered) and if so what were they?

None

  • Were there any changes in the data capture method and if so what were they? (e.g., revised questions, data collection mode, classification categories, algorithms for social media data)

None

  • Were there any changes in the sources of data and if so what were they? 

None

 

Accuracy

  • Any known sources of error?

None listed

  • Describe any quality control checks performed by the data’s owner.

Update and revise for error continuously

 

Accessibility

  • Any records or fields collected, but not included in data source, such as for confidentiality reasons)? 

None listed

  • Is there a subset of variables and/or data that is must be obtained through a separate process?? If yes, is there a separate legal, regulatory, or administrative restrictions on accessing the data source? Cost? - One time or annual or project based payment?

None listed

 

Privacy and security

  • Was consent given by participant? If so, how was consent given?

Yes, all data was retrieved properly from the outside sources

  • Are there legal limitations or restrictions on the use of the data? 

None listed

  • What confidentiality policies does the source have? 

None listed

 

Research

  • What research has been done with this dataset? 

None listed

  • Include any links to research if provided:
  • List any other data use notes provided by the supplier.

 

Gaps/Concerns

  • Feasibility - can all jurisdiction levels provide the data (if applicable)?
  • Data ownership - a lack of clarity in legal guidance stemming from a lack of clarity with who owns digital data?
  • Data collection authority - what data is reasonably private and what constitutes unwarranted intrusion?
  • Describe any other notes you have or any gaps/concerns you see with this dataset: