Research Method: Data Justice Scenarios

To understand how different datasets can be both beneficial and harmful to Detroit residents, we facilitated focus groups to brainstorm use cases of different data types and then craft scenarios of their use. These scenarios capture the perspectives of community organizers, homeowners, business owners, parents, artists and long-time residents of Detroit. They offer specific ideas for promoting ethical uses of data. Four focus groups were facilitated over two months in the fall of 2016, drawing approximately thirty participants total.

Scenario Creation

To craft each scenario, we introduced a single dataset offered through Detroit’s Open Data Portal, chosen based on two criteria:

  • the dataset is considered one the “most accessed” by users
  • that it has a “Data Lens” view. This is a built-in feature from Socrata, the technology provider of Detroit’s data portal, that provides internal tools for charting, mapping and filtering a dataset. This was useful for effectively introducing datasets to new users within the time constraints of the focus groups.

All exercises involved a mix of silent write and group discussion, and prompted participants to examine the dataset presented and reflect on the following four key questions:

  • Is this data neutral to begin? Is it clear how it is defined, collected or updated?
  • How could these data be used in beneficial ways to you or your communities?
  • How could these data be used in harmful ways to you or your communities?
  • What actions should be taken to maximize benefits and reduce harms? Whose responsibility are those actions (the community, the city, etc)?

We used participant feedback to refine and improve our prompts and facilitation methods between each session.

Findings

Below is a list of the seven datasets covered across the four focus groups. Click on each to read our perceived benefits and harms, as well as any initial questions raised by participants about how that data is collected, maintained or updated by the City.

911 Calls for Service

Initial questions

  • How can we know how accurate reported response times are?
  • Most calls appear to be hang-ups. How are these addressed?

Perceived benefits

  • Predicts where calls/events happen to efficiently allocate vehicles and resources and identify quick routes
  • Helps EMS and hospitals identify recurring patients and adjust their care plan

Perceived harms

  • Gives or reinforces impression of “bad” or “high crime” neighborhoods
  • Increases police presence in places with more calls, increasing profiling and arrests

Blight Violations

Initial questions

  • How does the City define “blight”? Was there a definition before the Duggan administration initiatives?

Perceived benefits

  • Shows “success” of City’s campaign to end blight
  • Supports organizers applying pressure to City to address complaints
  • Illustrates unequal enforcement and neighborhood targeting by ticket writers. What does “blight” look like in different neighborhoods?
  • Reveals discrepancies in where demolitions occur versus where blight is reported as highest through tickets

Perceived harms

  • Justifies divestment of services away from neighborhoods with more blight, like political intention to reallocate Hardest Hit Funds to demolitions rather than foreclosure relief
  • Effects property values and influences redlining practices by insurance companies
  • Concludes “bad” or “undesirable” areas based on old data, subjective definitions and enforcement

Building Permits

Initial questions

  • How soon do permit data get published to the portal from point of application and approval?

Perceived benefits

  • Allows research about locations of future developments and whether they are initiated by existing neighbors or new developers
  • Identifies where businesses are expanding and helps neighbors advocate against ones that have a bad record with the community

Perceived harms

  • Disguises information and creates false narrative. For instance, most permits are for single family homes, but from individual or bulk buyers?
  • Informs structure of mortgages and types of loans available because no clear way to distinguish structures that need foundational changes versus just remodeling

Crime Incidents

Initial questions

  • What about public crime incidents handled by entities outside of DPD, like Wayne State Police or other security forces?

Perceived benefits

  • Exposes details about how different crime ratings are calculated, especially in the case of maps or visuals that focus only on number of crimes, not their type

Perceived harms

  • Increases surveillance and policing in neighborhoods with higher crime levels in the past, justifying practices like “Stop and Frisk” now
  • Heat map creates assumptions of safety based only on quantity of crime, not type or severity. For example, a cluster of noise complaints look more severe than a single incident, even though that single incident may be an assault

Demolitions

Initial questions

  • Which dataset(s) informs which structures get slated for demolition in the first place? Foreclosure data or structure condition data? Are those data available on the portal?

Perceived benefits

  • Helps neighbors research land that is sited for demolition to organize campaigns to prevent destruction, especially of historically significant sites
  • Names vendors and contractors performing demolitions and tracks if and how much money is staying in the City

Perceived harms

  • Data fails to acknowledge structures up for demo that are community assets or historic landmarks, and there is no clear path for disputing a record to prevent demolition
  • Prioritizes certain neighborhoods, and therefore influences the housing market and property values

Improve Detroit Issues

Initial questions

  • Considering the digital divide, how many Detroiters can actually access this app? How many users have downloaded it to date?
  • Can non-Detroit residents report and up-vote issues? Is there a way to differentiate or flag these?

Perceived benefits

  • Saves residents money. For example, filling potholes means less car repairs, fixing water leaks benefits overall water bills, and cleaning up dumping boosts nearby property values
  • Encourages collective action, like once an issue is reported, rally your neighbors to up-vote it to top priority, and track progress of the issue
  • Follows trends in "improvements" to ideally illustrate how services are being delivered more efficiently by departments

Perceived harms

  • Access to the app accelerates divide in who gets city services, where they’re reported most, up-voted, and, therefore, responded to
  • Exaggerates false impressions of neighborhood issues and desirability, especially if non-residents are reporting or up-voting issues
  • Wrongfully penalizes residents. For example, your broken or uninsured car is parked on your street, but it gets reported as abandoned through the app, and is consequently towed and you owe a fine
  • Informs scamming, harassment and stealing; arsonists could find abandoned building locations, dumpers locate existing dump sites to add more to and be less likely to get caught, and metal scrappers locate abandoned vehicles

Parcel Ownership Information

Initial questions

  • We know some egregious property speculators change their name frequently, if and how can it be tracked through the system?

Perceived benefits

  • Addresses and highlights issues of land speculation
  • Shows community groups who is buying up land around them
  • Allows neighbors to find out who owns poor condition structures and advocate for community accountability/for owner to care for their property

Perceived harms

  • Hastens the process of land speculation, especially for those who have technical skills to access and use the data
  • Eases the process of big developers buying up huge tracts of land quickly

Ideas for Maximizing Benefits and Reducing Harm

After these scenario exercises, we prompted participants to re-examine the lists of benefits and harms in order to extract specific actions that might be taken to maximize benefits and reduce harms. The recommendations had some common threads across datasets.

Common actions recommended by participants:

  • Improve language provided directly through the portal, making the data more usable to those with less technical or policy expertise.
    This action falls to the responsibility of the City to provide more thorough documentation, engagement and training.
  • Initiate community-based research projects with the goal of highlighting discrepancies in City service delivery or code enforcement. Conducting research to investigate inconsistencies in open data is an important action to ensure that open data does not reinforce bad reputations or false impressions of certain Detroit neighborhoods, for fear of further justifying divestment and discrimination.
    This action calls on organizers and community-based data practitioners to mobilize around some of these big research questions that can be answered through open data.

Focus Groups

Groups One and Two

Group Three

  • 2 hours
  • Largest group
  • Participants represented a broad network of community organizers invited by members of the DDJC
  • Only a small handful were previously familiar with open data initiatives or the data portal

Group Four

  • Data scenario exercises were facilitated at the Data Discotech hosted at Butzel Community Center.

Next Steps

As we continue working on data justice initiatives through the DDJC and DCTP, we hope to continue facilitating these scenarios to more deeply understand the perceived positive and negative impacts of open data to our communities and eventually tackle some of the research questions that arise from them.