Codebook created
- No data dictionary was provided
Each variable profiled for quality (completeness, validity, consistency, and uniqueness). This is documented in the codebook.
Overall Data Description:
- Year: 2013
- Coverage: Arlington
- Unit of observation: Incident
- No Unique Identifier
Profiling summary
The tables below contain the results of the data profiling of key variables. To see more details and profiling for all variables, see codebook.
| Quality: Part 1 | ||||
|---|---|---|---|---|
| N | 4,083 | |||
| Duplications | No duplications | |||
| Variables | Completeness | Validity | Uniqueness | Consistency |
"Status | 100% | 100% | Levels: "M" "T" "U" | 100% |
"Score" | 100% | 100% | Levels: "0", "90.97", "100" | 100% |
| "Match_type" | 100% | 100% | 100% listed as "A" | 100%100% |
| "Match_addr | 683 missing | 100% | 100% | |
| "Side" | 683 missing | 100% | Levels: "L" "R" | 100% |
| "User_fld" | 683 missing | 100% | 100% listed as "0" | 100% |
| "Addr_type" | 683 missing | 100% | 100% listed as "Address" | 100% |
| "ARC_street" | 100% | 100% | Some are listed as address, others are listed as an intersection "Addresses are rounded to the nearest 100 block" | 100% |
| "ID" | 100% | 100% | 100% Unique (sequence 1:n) | 100% |
| "Date" | 100% | 100% | 100% | YYYY-MM-DD |
| "Day" | 100% | 100% | Levels: "Fri", "Mon", "Sat", "Sun", "Thu", "Tue", "Wed" | 100% |
| "Year" | 100% | 100% | 100% listed as "2013" | 100% |
| "Reported_T" | 100% | 100% | Range: 1 - 2400 | 100% |
| "Desc_" | 100% | 100% | Levels: "AGGRAVATED ASSAULT", "ALL OTHER LARCENY", "BURGLARY/BREAKING AND ENTERING", "FORCIBLE RAPE", "FROM COIN-OPERATED MACHINE OR DEVICE", "MOTOR VEHICLE THEFT", "POCKET-PICKING", "PURSE-SNATCHING" , "ROBBERY", "SHOPLIFTING", "THEFT FROM BUILDING", "THEFT FROM MOTOR VEHICLE", "THEFT OF MOTOR VEHICLE PARTS OR ACCESSORIE" | 100% |
| "Location" | 100% | 100% | Some are listed as address, others are listed as an intersection Matches ARC_street" | 100% |
| "coords.x1" | 100% | 17% (683) have invalid coordinate system | 17% (683) have the same invalid coordinate | 100% |
| "coords.x1" | 100% | 17% (683) have invalid coordinate system | 17% (683) have the same invalid coordinate | 100% |
| Quality: Part 2 | ||||
|---|---|---|---|---|
| N | 7,492 | |||
| Duplications | No duplications | |||
| Variables | Completeness | Validity | Uniqueness | Consistency |
"Status | 100% | 100% | Levels: "M" "T" "U" | 100% |
"Score" | 100% | 100% | Levels: "0", "90.97", "100" | 100% |
| "Match_type" | 100% | 100% | 100% listed as "A" | 100%100% |
| "Match_addr | 2,105 missing | 100% | 100% | |
| "Side" | 2,109 missing | 100% | Levels: "L" "R" | 100% |
| "User_fld" | 2,109 missing | 100% | 100% listed as "0" | 100% |
| "Addr_type" | 2,105 missing | 100% | 100% listed as "Address" | 100% |
| "ARC_street" | 100% | 100% | Some are listed as address, others are listed as an intersection "Addresses are rounded to the nearest 100 block" | 100% |
| "ID" | 100% | 100% | 100% Unique (sequence 1:n) | 100% |
| "Reported_D" | 100% | 100% | 100% | YYYY-Mon( Abb)-DD |
| "Day" | 100% | 100% | Levels: "Fri", "Mon", "Sat", "Sun", "Thu", "Tue", "Wed" | 100% |
| "Year" | 100% | 100% | 100% listed as "2013" | 100% |
| "Received_T" | 100% | 100% | Range: 1 - 235957 | 100% |
| "Desc_" | 100% | 100% | Levels: "ALL OTHER OFFENSES", "ASSISTING OR PROMOTING PROSTITUTION", "BAD CHECKS", "CONSPIRE TO COMMIT 1 OF GROUP A OFFENSES", "COUNTERFEITING/FORGERY", "CREDIT CARD/ATM FRAUD", "DESTRUCTION/DAMAGE/VANDALISM", "DISORDERLY CONDUCT", "DRIVING UNDER THE INFLUENCE", "DRUG EQUIPMENT VIOLATIONS", "DRUG/NARCOTIC VIOLATIONS", "DRUNKENNESS" ,"EMBEZZLEMENT", "EXTORTION/BLACKMAIL", "FALSE PRETENSES/SWINDLE", "FAMILY OFFENSES, NOVIOLENT", "FORCIBLE FONDLING (CHILD)", "IMPERSONATION", "INTIMIDATION", "LIQUOR LAW VIOLATIONS", "PEEPING TOM", "PORNOGRAPHY/OBSCENE MATERIAL", "PROSTITUTION", "RUNAWAY", "SIMPLE ASSAULT", "STOLEN PROPERTY OFFENSES", "TRESPASS OF REAL PROPERTY,, "WEAPON LAW VIOLATIONS", "WIRE FRAUD" | 100% |
| "Location" | 100% | 100% | Some are listed as address, others are listed as an intersection Matches ARC_street" | 100% |
| "coords.x1" | 100% | 52% (2,109) have invalid coordinate system | 52% (2,109) have the same invalid coordinate | 100% |
| "coords.x1" | 100% | 52% (2,109) have invalid coordinate system | 52% (2,109) have the same invalid coordinate | 100% |
Notes: Invalid geocodes were not a data transfer issues. Data came with warning that they are mostly geocoded. "There were some errors geocoding and I’m not sure that I can get around to getting a 100% match on all of the addresses. "