Variables included in the Dataset:
K-12
- Demographics
- School
- District
- Attendance/Dropout
- Grades/GPA
- Courses
- Test scores
- ACT/SAT
- Disciplinary action
- Teacher info (salaries, etc.)
Higher Education
- Demographics
- Type of college
- Enrollment
- Courses
- Major
- Grades
- Tuition/Scholarships
Workforce
- Demographics
- Salary
- Industry
Screening
- Is there data available for 2013?
- Can we access the data by August 15th? The data request and review process takes “several weeks” so not clear yet
Does this dataset appear to meet our needs for the Census study? NO
Explanation:
The OhioAnalytics administration has data for K-12, higher education, and workforce. The K-12 data does not link to higher education or workforce, however, higher education and workforce data can be linked. Spoke to staff Lisa Neilson at the Ohio Education Research Center which houses the data. She indicated that data requests for the 2012-2013 school year, and anything more recent, would take longer than August 15th to receive the data since it has not yet been archived.
Full Inventory
Purpose
- What is the purpose of the organization collecting the data?
“The Workforce Data Quality Initiative (WDQI) is an endeavor to develop longitudinal data covering workforce and education systems while serving as a resource for analysis and research.”
- Who else uses the data? (make a note if they sell the data to companies)
Mostly researchers
Description
- What is the general topic of the data? "comprehensive, longitudinal data system"
- K-12 Student Information
- Higher Education Student Information
- Workforce Information
- Longitudinal Education Information (includes K-12, Higher Ed, and/or Workforce)
- What are the earliest and latest dates for which data is available?
Earliest date unknown, have been emailed. Latest date is 2014.
- How soon after a reference period ends can a data source be prepared and provided?
Varies depending on the research question and extent of the data being requested. However it takes at least several months up to one year for data to be archived after the reference period.
Method
- What is the data collection method (portal, other)?
Portal
- What is the raw source of the collected data (teacher, superintendent)?
Unknown
Selectivity (conversely, the representativeness)
- What is the universe (e.g., population) that the data represents?
Public schools in Ohio
Stability/Coherence
- Note any changes to the universe of data being captured (e.g., including private schools).
Unknown
- Note any changes to the data capture method or sources of data.
Unknown
Metadata
- Is there a description of each variable in the source along with their valid values?
Yes
- Are there unique IDs for unique elements that can be used for linking data?
All personally identifying information are replaced with random pseudo-identifiers. It is not specified if these identifiers are maintained between primary education, higher education, and workforce.
- Can K-12 be linked to higher ed or higher ed to workforce?
Yes
Links to codebooks: https://www.chrr.ohio-state.edu/investigator/pages/search.jsp#
Accuracy
- Any known sources of error?
Unknown
- Describe any quality control checks performed by the state (or data manager).
Unknown
Accessibility
- How is the school-level data accessed (note if it needs to be screen scraped)?
Also needs to be requested
- How is the student-level data accessed?
There is an extensive data request procedure that requires background on the researcher and research team members, a detailed research plan including the specific research variables, data security plan, and signing an OERC and/or OWDQL data sharing agreement
- Note if IRB is needed or any other restrictions on accessing data.
Yes
- Any records or fields collected, but not included in data source?
N/A
- Cost? - One time or annual or project based payment?
Cost is project based and is dependent on the availability of the requested data, the complexity of the research design, and the extent to which CHRR staff are involved in preparing, matching, and analyzing the data.
Privacy and security
- Note any confidentiality policies or legal limitations other than FERPA:
HIPAA
- What do they consider personally identifiable information?
Student names
Research
- What research has been done with this dataset?
Much research has been done with Ohio Analytics data. The link below provides the name, description, and what data sets are being used for each project. Some examples of these project are academic outcomes of dual enrollment students, course trajectories of STEM community college students, and “Straight A” outcomes.
- Research links: http://www.ohioanalytics.gov/Reports/Project-Status-Report.stm
Describe any other notes you have or any gaps/concerns you see with this dataset: On 7/14/15, received email from Lisa Nelson explaining "we have in-house to link the primary education records to other data sets."