VT Census Case Studies : Pennsylvania Workforce

Variables included in the Dataset:

K-12

  • Demographics
  • School
  • District
  • Attendance/Dropout
  • Grades/GPA
  • Courses
  • Test scores
  • ACT/SAT
  • Disciplinary action
  • Teacher info (salaries, etc.)

Higher Education

  • Demographics
  • Type of college
  • Enrollment
  • Courses
  • Major
  • Grades
  • Tuition/Scholarships

Workforce

  • Demographics
  • Salary
  • Industry

Link: https://paworkstats.geosolinc.com/vosnet/Default.aspx

 

Date Inventory Completed: 5/22/15

Screening

  • Is there data available for 2013?
  • Can we access the data by August 15th?

Does this dataset appear to meet our needs for the Census study? UNDECIDED

ExplanationData is mostly from the Bureau of Labor Statistics. Does not include individual-level information.

Full Inventory

Purpose

  • What is the purpose of the organization collecting the data?

To collect information on the economic status of Pennsylvania’s labor market.

  • Who else uses the data? (make a note if they sell the data to companies)

Businesses, policy makers, potential employees


Description

  • What is the general topic of the data?
  • K-12 Student Information
  • Higher Education Student Information
  • Workforce Information
  • Longitudinal Education Information (includes K-12, Higher Ed, and/or Workforce)

 

  • What are the earliest and latest dates for which data is available?

Varies based on the dataset.

  • How soon after a reference period ends can a data source be prepared and provided?

Unknown

 

Method

  • What is the data collection method (portal, other)?

Unclear

  • What is the raw source of the collected data (teacher, superintendent)?

Varies, some from Bureau of Labor Statistics, some algorithms to create datasets from job listings, other sources too.


Selectivity (conversely, the representativeness)

  • What is the universe (e.g., population) that the data represents?

All individuals in the Pennsylvania workforce


Stability/Coherence

  • Note any changes to the universe of data being captured (e.g., including private schools).

Unknown

  • Note any changes to the data capture method or sources of data.

Unknown

 

Metadata

  • Is there a description of each variable in the source along with their valid values? 

No

  • Are there unique IDs for unique elements that can be used for linking data? 

Unknown

  • Can K-12 be linked to higher ed or higher ed to workforce? 

Unknown

  • Links to codebooks: 

N/A

 

Accuracy

  • Any known sources of error?

Unknown

  • Describe any quality control checks performed by the state (or data manager).

Unknown

 

Accessibility

  • How is the school-level data accessed (note if it needs to be screen scraped)?

Only aggregated data is available

  • How is the student-level data accessed?

Raw, individual-level data is not available.

  • Note if IRB is needed or any other restrictions on accessing data.

N/A

  • Any records or fields collected, but not included in data source? 

N/A

  • Cost? - One time or annual or project based payment? 

N/A


Privacy and security

  • Note any confidentiality policies or legal limitations other than FERPA: 

N/A

  • What do they consider personally identifiable information? 

Unclear


Research

  • What research has been done with this dataset? 

Unknown

  • Research links:

N/A

Describe any other notes you have or any gaps/concerns you see with this dataset: Unemployment data comes from the BLS, meaning that participants were likely involved in a standard unemployment survey.