Profiling

Codebook created (Data came with no documentation, so had to create a codebook in order to proceed)

Each variable profiled for quality (completeness, validity, consistency, and uniqueness). This is documented in the codebook.

Overall Data Description

  • Unit of observation: Parcel
  • Number of observations:  29,429

The table below contain the results of the data profiling of key variables. To see more details and profiling for all variables, see codebook. The data came in one large table, which was the separated into the different table below.

Quality: Location
DuplicationsNo duplication
PINNo duplication
VariablesCompletenessValidityUniquenessConsistency

ADDR_NUM

128 missing100% NA's are coded as " "
PRE_DIR 100%

Levels: "E" "N" "S" "W"

NA's are coded as " "
PLAINST125 Missing100%

1,485 levels

NA's are coded as " "
STREETTYPE

5,989 Missing

100%63 LevelsNA's are coded as " "

SUF_DIR

 100%Levels: "E" "N" "S" "W"NA's are coded as " "
CITYL100%Two are cities while the other is JCC

Levels: "JAMC" "LAN"  "TNO"

NA's are coded as " "
Census_Tr100%Coding does not match traditional GeoID for tracts11 levels100%
Quality: Characteristics
DuplicationsNo duplication
PINNo duplication
VariablesCompletenessValidityUniquenessConsistency
Res_Units100%100% 100%
PCDesc12 missing100%

Levels: “Agricultural 100+ acres",  "Agricultural 20-99 acres", "Commercial & Industrial", "Exempt - Educational", "Exempt - Local Govt",  "Exempt - Other", "Exempt - Religious", "Exempt - State Govt", "Multi-Family" , "SCC Assessed", "Single Family - Suburban"

NA's are coded as " "
YrBuilt11% missingYears coded with a , NA's are coded as "0"

NumBdRms

100%100% 100%
Num2Baths100%100% 100%
Num3Baths100%100% 100%
HeatDesc11% missingInvalid entries (e.g square footage)Levels: "Baseboard”, "Central Warm Air”, "Electric baseboard”, "Forced hot air", "Forced hot air-elec", "Forced hot air-gas", "Forced hot air-oil", "Geothermal", "Geothermal or solar", "Gravity-oil", "Heat pump”, "Hot water", "Hot water or steam", "No Heat", "No heat space", "No heat-floor unit”, “No heat-wood stove/insert", "Other"” "Solar Active",  "Space heater", "Space heater-elec", "Undefined"NA's are coded as " "
Quality: Sale History
DuplicationsNo duplication
PINNo duplication
VariablesCompletenessValidityUniquenessConsistency
Sale1D5% missing1 has year sold as 2041 NA's are coded as " "
Sale1Amt100%

34% listed at 0

Coded with a ,

 100%
Sale2D16% missing100% 100%
Sale2Amt100%

42% listed at 0

Coded with a ,

 100%
Sale3D35% missing100% 100%
Sale3Amt100%

58% listed at 0

Coded with a ,

 100%

Attachments:

JCC Parcel Data Dictionary.docx (application/vnd.openxmlformats-officedocument.wordprocessingml.document)