Codebook created
- No data dictionary was provided
Each variable profiled for quality (completeness, validity, consistency, and uniqueness). This is documented in the codebook.
Overall Data Description:
- Year: 2013
- Coverage: Arlington
- Unit of observation: Incident
- No Unique Identifier
Profiling summary
The tables below contain the results of the data profiling of key variables. To see more details and profiling for all variables, see codebook.
Quality: Part 1 | ||||
---|---|---|---|---|
N | 4,083 | |||
Duplications | No duplications | |||
Variables | Completeness | Validity | Uniqueness | Consistency |
"Status | 100% | 100% | Levels: "M" "T" "U" | 100% |
"Score" | 100% | 100% | Levels: "0", "90.97", "100" | 100% |
"Match_type" | 100% | 100% | 100% listed as "A" | 100%100% |
"Match_addr | 683 missing | 100% | 100% | |
"Side" | 683 missing | 100% | Levels: "L" "R" | 100% |
"User_fld" | 683 missing | 100% | 100% listed as "0" | 100% |
"Addr_type" | 683 missing | 100% | 100% listed as "Address" | 100% |
"ARC_street" | 100% | 100% | Some are listed as address, others are listed as an intersection "Addresses are rounded to the nearest 100 block" | 100% |
"ID" | 100% | 100% | 100% Unique (sequence 1:n) | 100% |
"Date" | 100% | 100% | 100% | YYYY-MM-DD |
"Day" | 100% | 100% | Levels: "Fri", "Mon", "Sat", "Sun", "Thu", "Tue", "Wed" | 100% |
"Year" | 100% | 100% | 100% listed as "2013" | 100% |
"Reported_T" | 100% | 100% | Range: 1 - 2400 | 100% |
"Desc_" | 100% | 100% | Levels: "AGGRAVATED ASSAULT", "ALL OTHER LARCENY", "BURGLARY/BREAKING AND ENTERING", "FORCIBLE RAPE", "FROM COIN-OPERATED MACHINE OR DEVICE", "MOTOR VEHICLE THEFT", "POCKET-PICKING", "PURSE-SNATCHING" , "ROBBERY", "SHOPLIFTING", "THEFT FROM BUILDING", "THEFT FROM MOTOR VEHICLE", "THEFT OF MOTOR VEHICLE PARTS OR ACCESSORIE" | 100% |
"Location" | 100% | 100% | Some are listed as address, others are listed as an intersection Matches ARC_street" | 100% |
"coords.x1" | 100% | 17% (683) have invalid coordinate system | 17% (683) have the same invalid coordinate | 100% |
"coords.x1" | 100% | 17% (683) have invalid coordinate system | 17% (683) have the same invalid coordinate | 100% |
Quality: Part 2 | ||||
---|---|---|---|---|
N | 7,492 | |||
Duplications | No duplications | |||
Variables | Completeness | Validity | Uniqueness | Consistency |
"Status | 100% | 100% | Levels: "M" "T" "U" | 100% |
"Score" | 100% | 100% | Levels: "0", "90.97", "100" | 100% |
"Match_type" | 100% | 100% | 100% listed as "A" | 100%100% |
"Match_addr | 2,105 missing | 100% | 100% | |
"Side" | 2,109 missing | 100% | Levels: "L" "R" | 100% |
"User_fld" | 2,109 missing | 100% | 100% listed as "0" | 100% |
"Addr_type" | 2,105 missing | 100% | 100% listed as "Address" | 100% |
"ARC_street" | 100% | 100% | Some are listed as address, others are listed as an intersection "Addresses are rounded to the nearest 100 block" | 100% |
"ID" | 100% | 100% | 100% Unique (sequence 1:n) | 100% |
"Reported_D" | 100% | 100% | 100% | YYYY-Mon( Abb)-DD |
"Day" | 100% | 100% | Levels: "Fri", "Mon", "Sat", "Sun", "Thu", "Tue", "Wed" | 100% |
"Year" | 100% | 100% | 100% listed as "2013" | 100% |
"Received_T" | 100% | 100% | Range: 1 - 235957 | 100% |
"Desc_" | 100% | 100% | Levels: "ALL OTHER OFFENSES", "ASSISTING OR PROMOTING PROSTITUTION", "BAD CHECKS", "CONSPIRE TO COMMIT 1 OF GROUP A OFFENSES", "COUNTERFEITING/FORGERY", "CREDIT CARD/ATM FRAUD", "DESTRUCTION/DAMAGE/VANDALISM", "DISORDERLY CONDUCT", "DRIVING UNDER THE INFLUENCE", "DRUG EQUIPMENT VIOLATIONS", "DRUG/NARCOTIC VIOLATIONS", "DRUNKENNESS" ,"EMBEZZLEMENT", "EXTORTION/BLACKMAIL", "FALSE PRETENSES/SWINDLE", "FAMILY OFFENSES, NOVIOLENT", "FORCIBLE FONDLING (CHILD)", "IMPERSONATION", "INTIMIDATION", "LIQUOR LAW VIOLATIONS", "PEEPING TOM", "PORNOGRAPHY/OBSCENE MATERIAL", "PROSTITUTION", "RUNAWAY", "SIMPLE ASSAULT", "STOLEN PROPERTY OFFENSES", "TRESPASS OF REAL PROPERTY,, "WEAPON LAW VIOLATIONS", "WIRE FRAUD" | 100% |
"Location" | 100% | 100% | Some are listed as address, others are listed as an intersection Matches ARC_street" | 100% |
"coords.x1" | 100% | 52% (2,109) have invalid coordinate system | 52% (2,109) have the same invalid coordinate | 100% |
"coords.x1" | 100% | 52% (2,109) have invalid coordinate system | 52% (2,109) have the same invalid coordinate | 100% |
Notes: Invalid geocodes were not a data transfer issues. Data came with warning that they are mostly geocoded. "There were some errors geocoding and I’m not sure that I can get around to getting a 100% match on all of the addresses. "