Using data to better understand and improve cities is no longer a revolutionary new idea. Cities across the world now release data publicly and use data internally to drive better services, social scientists increasingly use “big” data and computation to study urban environments, and civic hacking groups create data-driven websites and apps to inform and benefit communities. But in many ways, these are just the low-hanging fruit of urban data science, which remains a young field with more promise than results.

The first University of Chicago Convening on Urban Data Science, organized by the CI’s Urban Center for Computation and Data (UrbanCCD) and the UChicago Urban and sponsored by the Harris School for Public Policy, reflected this early stage of growth. There was ample enthusiasm about early projects using analytics, sensing, mapping, and both public and private data sources. There was excitement about the future and new collaborations formed across countries, disciplines, and public/private/academic spheres. But there were also passionate discussions about the ethical and moral implications of urban data science, an important reflection for a fast-growing field with the goal of improving policy and people’s lives.

Many of those topics were addressed at the very start by Julia Lane, Professor at the NYU Center for Urban Science and Progress (CUSP), who focused her keynote speech on the rapid maturation of urban data science. Her stance was that the social science of cities, as it starts to grapple with massive datasets and major instrumentation projects, must look to preceding “Big Science” projects in physics, genomics, astronomy, and other fields for guidance.

“In common with other areas of science, the scale is changing,” Lane said. “It’s not just that we can measure these things; we have to stop and think, what are we measuring? What are the research questions? That’s where the science comes in.”

Among her recommendations were to build a community of scientists, avoid building closed-off “data dungeons,” and to identify key priority areas to focus on first. She also proposed that city/university partnerships would be the ideal fundamental node of urban science activity, creating a new system where local data collected on the city level flowed up to the federal level, instead of the current model where federal agencies such as the U.S. Census Bureau collect and release most data on American cities.

Luckily, those themes were already well represented in the Convening’s agenda. A panel on partnerships between academia and policymakers included university groups such as Urban Labs, which is working with Los Angeles County on homelessness and poverty, and City of Chicago Chief Data Officer Tom Schenk, who leads predictive analytics efforts for city health inspections and the launch of Chicago’s OpenGrid platform. Schenk said that open data and open source technology have opened up new partnership opportunities, such as a community effort to better predict beach e. Coli levels this summer, and allowed Chicago to share its data products with other cities.

The conference’s second day began with a focus on ethics, based around a presentation by Peter Elias from the University of Warwick and Hallvard Fossheim of the University of Bergen. The two speakers were part of the team behind an upcoming Organization for Economic Cooperation and Development report on research ethics in urban data science, and their summary of the task force’s recommendations sparked a spirited conversation.

The foremost topic was informed consent in an age of increased data collection, where the public may not always be aware of how their data will be used by governments and researchers. Where the report recommended the use of Ethics Review Boards similar to those seen in medical research, a response panel including Lane, Nicole Marwell from UChicago, and NSF's Peter Muhlberger argued instead for a Code of Ethics to create research norms for urban science, as well as a commitment to only pursue research where potential benefits exceed potential risk.

An example of that balancing act was provided by Pete Edwards of the University of Aberdeen, who described a project in rural Scotland that used GPS locations from bus passengers’ cell phones to estimate bus arrival time for other riders. Even though the passengers’ personal data was not shared, because of the low population in the area, it was not difficult to determine who each passenger probably was based on where they got on and off the bus. But when the researchers expressed their privacy concerns to the community, the members said it was worth the access to bus arrival information -- even believing bus service had improved despite no changes to arrival frequency.

Other sessions were more methods-focused, examining how different public data sources and techniques such as GIS mapping could help answer important questions about cities. For instance, Jennifer Doleac from University of Virginia presented a study on using data from ShotSpotter technology, which uses audio sensors to detect and locate gunfire, to assess whether juvenile curfews affect street crime. A project called Micro-Array of Things, run by University of Chicago Medicine’s Stacy Tessler Lindau, Douglas Pancoast from the School of the Art Institute of Chicago, and Paloma Gonzalez Rojas of MIT, will use cheap sensors and XBox Kinects to observe how families use an open free food pantry in the children’s hospital.

Another project centered at University of Chicago Medicine would like to take an even more ambitious approach to using data to study the interaction between environment, experience, and personal health. A team including CI fellows Marc Berman and Samuel Volchenboum and UrbanCCD’s Will Engler and Maggie King is starting to look at ways to connect clinical data to data on environmental and social factors, from air quality and the number of trees in your neighborhood to transportation access and crime. The ultimate goal would be to create a more complete picture of a person’s health and environment, so that physicians can treat patients more effectively and address non-medical issues as well.

The project was just the kind of initiative that the Convening was meant to inspire, one that crosses traditional disciplinary lines and builds new knowledge and research out of existing or imminent data and technology. A testament to the conference’s success on that point came in the “closing remarks,” when organizer and UrbanCCD director Charlie Catlett declined to interrupt the many productive conversations happening around the room and said simply, “We hope you walk away with new collaborators” -- fulfilling urban data science’s promise, one new relationship at a time.