Profiling
Codebook created (Data came with no documentation, so had to create a codebook in order to proceed)
- See codebook here: JCC Parcel Code Book
Each variable profiled for quality (completeness, validity, consistency, and uniqueness). This is documented in the codebook.
Overall Data Description
- Unit of observation: Parcel
- Number of observations: 29,429
The table below contain the results of the data profiling of key variables. To see more details and profiling for all variables, see codebook. The data came in one large table, which was the separated into the different table below.
Quality: Location | ||||
---|---|---|---|---|
Duplications | No duplication | |||
PIN | No duplication | |||
Variables | Completeness | Validity | Uniqueness | Consistency |
ADDR_NUM | 128 missing | 100% | NA's are coded as " " | |
PRE_DIR | 100% | Levels: "E" "N" "S" "W" | NA's are coded as " " | |
PLAINST | 125 Missing | 100% | 1,485 levels | NA's are coded as " " |
STREETTYPE | 5,989 Missing | 100% | 63 Levels | NA's are coded as " " |
SUF_DIR | 100% | Levels: "E" "N" "S" "W" | NA's are coded as " " | |
CITYL | 100% | Two are cities while the other is JCC | Levels: "JAMC" "LAN" "TNO" | NA's are coded as " " |
Census_Tr | 100% | Coding does not match traditional GeoID for tracts | 11 levels | 100% |
Quality: Characteristics | ||||
---|---|---|---|---|
Duplications | No duplication | |||
PIN | No duplication | |||
Variables | Completeness | Validity | Uniqueness | Consistency |
Res_Units | 100% | 100% | 100% | |
PCDesc | 12 missing | 100% | Levels: “Agricultural 100+ acres", "Agricultural 20-99 acres", "Commercial & Industrial", "Exempt - Educational", "Exempt - Local Govt", "Exempt - Other", "Exempt - Religious", "Exempt - State Govt", "Multi-Family" , "SCC Assessed", "Single Family - Suburban" | NA's are coded as " " |
YrBuilt | 11% missing | Years coded with a , | NA's are coded as "0" | |
NumBdRms | 100% | 100% | 100% | |
Num2Baths | 100% | 100% | 100% | |
Num3Baths | 100% | 100% | 100% | |
HeatDesc | 11% missing | Invalid entries (e.g square footage) | Levels: "Baseboard”, "Central Warm Air”, "Electric baseboard”, "Forced hot air", "Forced hot air-elec", "Forced hot air-gas", "Forced hot air-oil", "Geothermal", "Geothermal or solar", "Gravity-oil", "Heat pump”, "Hot water", "Hot water or steam", "No Heat", "No heat space", "No heat-floor unit”, “No heat-wood stove/insert", "Other"” "Solar Active", "Space heater", "Space heater-elec", "Undefined" | NA's are coded as " " |
Quality: Sale History | ||||
---|---|---|---|---|
Duplications | No duplication | |||
PIN | No duplication | |||
Variables | Completeness | Validity | Uniqueness | Consistency |
Sale1D | 5% missing | 1 has year sold as 2041 | NA's are coded as " " | |
Sale1Amt | 100% | 34% listed at 0 Coded with a , | 100% | |
Sale2D | 16% missing | 100% | 100% | |
Sale2Amt | 100% | 42% listed at 0 Coded with a , | 100% | |
Sale3D | 35% missing | 100% | 100% | |
Sale3Amt | 100% | 58% listed at 0 Coded with a , | 100% |
Attachments:
