Variables included in the Dataset:
K-12
- Demographics
- School
- District
- Attendance/Dropout
- Grades/GPA
- Courses
- Test scores
- ACT/SAT
- Disciplinary action
- Teacher info (salaries, etc.)
Higher Education
- Demographics
- Type of college
- Enrollment
- Courses
- Major
- Grades
- Tuition/Scholarships
Workforce
- Demographics
- Salary
- Industry
Screening
- Is there data available for 2013?
- Can we access the data by August 15th?
Does this dataset appear to meet our needs for the Census study? NO
Explanation: Connecticut data can be linked between K-12, higher ed and workforce, but access to this linked information is very limited. "Data Requests must be for the purpose of conducting an audit or evaluation of a publicly funded education program 34 C.F.R. 99.1 and be of benefit to a Local or State Education Authority or Agency in order to be approved. Requests must also be in compliance with limitations imposed by state and federal law with regard to education and unemployment insurance data. For example, unit record unemployment wage data may only be supplied to public officials." Therefore it appears that outside researchers are not able to access the linked data.
Full Inventory
Purpose
- What is the purpose of the organization collecting the data?
“The Connecticut State Department of Education (CSDE) collects data from public school districts and other education providers to meet reporting requirements, distribute funding, guide policy, inform accountability, facilitate data use, and report to the public with the ultimate goal of improving educational outcomes for all students.”
- Who else uses the data? (make a note if they sell the data to companies)
Parents, policy-makers, researchers
Description
- What is the general topic of the data?
- K-12 Student Information
- Higher Education Student Information
- Workforce Information
- Longitudinal Education Information (includes K-12, Higher Ed, and/or Workforce)
- What are the earliest and latest dates for which data is available?
2004-2013
- How soon after a reference period ends can a data source be prepared and provided?
At least one year.
Method
What is the data collection method (portal, other)?
portal
What is the raw source of the collected data (teacher, superintendent)?
school administrator
Selectivity (conversely, the representativeness)
- What is the universe (e.g., population) that the data represents?
Students attending public schools. Not clear if this includes charter and private schools.
Stability/Coherence
Note any changes to the universe of data being captured (e.g., including private schools).
Unknown
Note any changes to the data capture method or sources of data.
Unknown
Metadata
Is there a description of each variable in the source along with their valid values?
Yes
Are there unique IDs for unique elements that can be used for linking data?
Yes
Can K-12 be linked to higher ed or higher ed to workforce?
Yes, however, it is not available to researchers.
Links to codebooks:
http://www.ct.edu/initiatives/p20win-dictionary
Accuracy
Any known sources of error?
Unknown
Describe any quality control checks performed by the state (or data manager).
Unknown
Accessibility
- How is the school-level data accessed (note if it needs to be screen scraped)?
Via Excel spreadsheets (http://sdeportal.ct.gov/Cedar/WEB/ct_report/DTHome.aspx) as well as through online customizable reports that can be screen-scraped: http://cmt3.cmtreports.com/iReport/Default.aspx
- How is the student-level data accessed?
Submit a data request form. The time needed to fulfill requests is dependent on the complexity of the request. CSDE states that they do not release Personal and Identifiable Information (PII) as part of routine data requests. It is not clear how they define PII. In rare instances, the CSDE may choose to engage in a formal FERPA-compliant legal agreement with another organization for research and evaluation purposes in which case PII may be released.
- Note if IRB is needed or any other restrictions on accessing data.
N/A
- Any records or fields collected, but not included in data source?
Removes information where the cell count is less than or equal to five.
- Cost? - One time or annual or project based payment?
There may be a cost for fulfilling requests that require substantial hours to fulfill.
Privacy and security
Note any confidentiality policies or legal limitations other than FERPA:
NA
What do they consider personally identifiable information?
Any information that could identify a particular student
Research
What research has been done with this dataset?
School performance
Research links:
http://sdeportal.ct.gov/Cedar/WEB/ResearchandReports/ResearchReports.aspx
Describe any other notes you have or any gaps/concerns you see with this dataset: