VT Census Case Studies : California K-12 Education

Variables included in the Dataset:

K-12

  • Demographics
  • School
  • District
  • Attendance/Dropout
  • Grades/GPA
  • Courses
  • Test scores
  • ACT/SAT
  • Disciplinary action
  • Teacher info (salaries, etc.)

Higher Education

  • Demographics
  • Type of college
  • Enrollment
  • Courses
  • Major
  • Grades
  • Tuition/Scholarships

Workforce

  • Demographics
  • Salary
  • Industry

Link: http://www.cde.ca.gov/ds/

 

Date Inventory Completed: 6/4/15

Screening

  • Is there data available for 2013?
  • Can we access the data by August 15th?

Does this dataset appear to meet our needs for the Census study? UNDECIDED

ExplanationUpdate: Spoke on phone with representatives, student-level data is available for certain variables (could not say which ones due to FERPA) if request process gets approved. They also do not have anyway to link to college data.

Full Inventory

Purpose

  • What is the purpose of the organization collecting the data?

“Data and statistics collected from California schools and learning support resources to identify trends and educational needs and to measure performance.”

  • Who else uses the data? (make a note if they sell the data to companies)

Data is mainly used by the State government for School Accountability Report Cards, Adequate Yearly Progress Reports, Physical Fitness Test Results, etc.


Description

  • What is the general topic of the data?
  • K-12 Student Information
  • Higher Education Student Information
  • Workforce Information
  • Longitudinal Education Information (includes K-12, Higher Ed, and/or Workforce)

 

  • What are the earliest and latest dates for which data is available?

1999-2013

  • How soon after a reference period ends can a data source be prepared and provided?

N/A

 

Method

  • What is the data collection method (portal, other)?

Portal

  • What is the raw source of the collected data (teacher, superintendent)?

N/A

 

Selectivity (conversely, the representativeness)

  • What is the universe (e.g., population) that the data represents?

K-12 for public schools. There is general information collected on K-12 private schools such as their location, programs offered, and enrollment.


Stability/Coherence

  • Note any changes to the universe of data being captured (e.g., including private schools).

Unknown

  • Note any changes to the data capture method or sources of data.

Unknown

 

Metadata

  • Is there a description of each variable in the source along with their valid values?

No

  • Are there unique IDs for unique elements that can be used for linking data?

Yes

  • Can K-12 be linked to higher ed or higher ed to workforce?

No

  • Links to codebooks: 

NA

 

Accuracy

  • Any known sources of error?

Unknown

  • Describe any quality control checks performed by the state (or data manager).

Unknown


Accessibility

  • How is the school-level data accessed (note if it needs to be screen scraped)?

 Data is downloadable as .TXT

  • How is the student-level data accessed?

A preliminary data request form must filled out specifying what data you want, purpose of request, intent to publish, and Research/Evaluation Concept Paper. Requestor will be contact if more information is needed. A MOU will also have to be drafted.

  • Note if IRB is needed or any other restrictions on accessing data.

Depending on the data requested

  • Any records or fields collected, but not included in data source?

If the number of students in a certain cell is small enough that they may be identified, that data will be redacted.

  • Cost? - One time or annual or project based payment?

$65/hour for data preparation

 

Privacy and security

  • Note any confidentiality policies or legal limitations other than FERPA:

California Information Practice Act and California Education Code Section 49062 of the California Constitution

  • What do they consider personally identifiable information?

Student’s name, identification number, address, race, gender, DOB, place of birth, and parent/guardian information.


Research

  • What research has been done with this dataset?

Most research being done is for postsecondary preparedness.


Describe any other notes you have or any gaps/concerns you see with this dataset: