Jeremy S. Wu, Ph.D.
Jeremy S. Wu, Ph.D.
  • Home
  • About
    • Personal
  • Activities
    • Regency at McLean
  • Big Data
    • Maps
      • Asian Americans by CD 2015
      • Asian Americans by CD 2014
      • Asian Americans by CD 2013
      • Berkeley Earth
      • Chinese Smart Cities
    • 清华论坛
  • Blogs
  • Justice
    • 1882 Timeline
    • 2020 Census
    • APA FISA Watch
    • Fed Cases
    • Profiling

Jeremy S. Wu, Ph.D.

胡善庆博士

Jeremy S. Wu, Ph.D.
  • Home
  • About
    • Personal
  • Activities
    • Regency at McLean
  • Big Data
    • Maps
      • Asian Americans by CD 2015
      • Asian Americans by CD 2014
      • Asian Americans by CD 2013
      • Berkeley Earth
      • Chinese Smart Cities
    • 清华论坛
  • Blogs
  • Justice
    • 1882 Timeline
    • 2020 Census
    • APA FISA Watch
    • Fed Cases
    • Profiling

2014 Workshop on Big Data and Urban Informatics

  • Big Data
  • Statistics 2.0

After more than a year of preparation, the Workshop on Big Data and Urban Informatics was held at the University of Illinois at Chicago on August 11-12, 2014.

More than 150 persons from at least 10 countries (Australia, Canada, China, Greece, Israel, Italy, Japan, Portugal, United Kingdom, and the U.S.) attended the forum sponsored by the National Science Foundation.  

Piyushimita (Vonu) Thakuriah, co-chair for the workshop, reported on the funding of Urban Big Data Center at the University of Glasgow in Scotland (http://bit.ly/1kXG2Uh).  Its mission is to “support research for improved understanding of urban challenges and to provide data, technology and services to manage, make policy, and innovate in cities.”  The Urban Big Data Center partners with five other universities including the University of Illinois at Chicago. Vonu, a transportation expert, is the director of the center.

In the course of two full days, 68 excellent presentations were made in total, far exceeding the expectations of the organizers a year ago.  These papers will be posted in the web in the near future.  

Two luncheon keynote speakers highlighted the workshop.  

Carlo Ratti presented the state-of-the-art work of the MIT SENSEable City Lab, which specializes in the deployment of sensors and hand-held electronics to study the environment.  Since conventional measures of air quality tend to be collected at stationary locations, they do not always represent the exposure of a mobile individual.  In one project titled “One Country, Two Lungs” (http://bit.ly/1nbSBXi), a team of human probes travelled between Shenzhen and Hong Kong to detect urban air pollution.  The video revealed the divisions in atmospheric quality and individual exposure between these two cities. 

Paul Waddell of the University of California at Berkeley presented his work on urban simulation and dynamic 3-D visualization of land use and transportation.  Some of his impressive work images can be found at http://bit.ly/1rn9hmj.  His video and examples reminded me about their potential applicability for creating the “Three Districts and Four Lines” in China’s National Urbanization Plan.  I also learned about a somewhat similar set of products from China’s supermap.com, a Geographic Information System software company based in Beijing. 

One of the 68 presentations described the use of smart card data to study the commuting patterns and volume in Beijing subways during rush hours.  One other presentation compared the characteristics of big data and statistics and raised the question of whether big data is a supplement or a substitute to statistics. 

The issue of data quality was seldom volunteered in the sessions, but questions about it came up frequently.  Through editing, filtering, cleaning, scrubbing, imputing, curating, re-structuring, and many other terms, it was clear that some presenters spent an enormous amount of their time and efforts to just get the data ready for very basic use.

Perhaps data quality is considered secondary in exploratory work.  However, there are good quality big data and bad quality big data.  When other options are available, spending too much time and effort on bad quality big data seems unwise because it does not project a practical, future purpose.

There were also few presentations that discussed the importance of data structure, whether it is already built in as design or created through metadata.  Structured data contain far more potential information content than unstructured ones and tend to be more efficient and optimal in information extraction, especially if they have the capability to be linked across multiple sources.  

For the purpose of governance, I was somewhat surprised that use of administrative records has not yet caught on in this workshop.  Accessibility and confidentiality appeared to be barriers.  It would seem helpful for future workshops to include city administrators and public officials to help bridge the gap between research and practical needs for day-to-day operations.  

Nations and cities share a common goal in urban planning and urban informatics – improve the quality of city life and service delivery to constituents and businesses alike.  On the other hand, there are drastic differences in their current standing and approach.

China is experiencing the largest human migration in history.  It has established goals and direction for urban development, but has little reliable, quantitative research or experience to support and execute its plans.  The West is transitioning from its century-old urban living to a future that is filled with exciting creativity and energy, but does not seem to have as clear a vision or direction.

Confidentiality is an issue that contrasts sharply between China and the West.  The Chinese plans show strong commitment to collect and merge linkable individual records extensively.  If implemented successfully, it will generate unprecedented amount of detailed information that can also be abused and misused.  The same approach would likely face much scrutiny and opposition in the West, which has to consider less reliable but more costly alternatives in order to meet the same needs. 

There is perhaps no absolute right or wrong approach to these issues.  The workshop and the international community being created offer a valuable opportunity to observe, discuss, and make comparisons in many globally common topics. 

Selected papers from the workshop will now undergo additional peer review.  They will be published in an edited volume titled “See Cities Through Big Data – Research, Methods and Applications in Urban Informatics.”

Confidentiality Data Quality Environment Transportation Urban Informatics
August 23, 2014 Jeremy

Post navigation

推动中国智慧城市发展,小统计势在必行 → ← Smoking Statistics in the U.S. and China

Related Posts

Crossing the Stream and Reaching the Sky

In the early stages of its economic reform, China chose to "cross a stream by feeling the rocks."Limited by expertise and conditions at that time when there was no statistical […]

Not All Data are Created Equal

Suppose we have data on 60,000 households.  Are they useful for analysis? If we add that the amount of data is very large, like 3 TB or even 30 TB, […]

Lying with Big Data

About 45 years ago, I spent a whopping $1.95 on a little book titled "How to Lie with Statistics."Besides the catchy title, its bright orange cover has a comic character […]

Smart Wuhan, Built on Big Data

智慧武汉:善用大数据The following is an abstract for a presentation given in the Committee of 100 Fourth Tien Changlin (田长霖) Symposium held in Wuhan, China, on June 20, 2013.The presentation in simplified […]

Recent Posts

NSD201801-042

Trade Secrets to South KoreaOn May 1, 2015, Kolon Industries, Inc., a South Korean industrial company, was sentenced in the Eastern District of Virginia to 5 years’ probation and was ordered […]

More Info

NSD201801-040

Theft of Trade Secrets by Chinese Professors for Technology to ChinaOn May 16, 2015, Tianjin University Professor Hao Zhang was arrested upon entry into the U.S. from the People’s Republic [...]

More Info

NSD201801-029

Theft of Valuable Source Code for ChinaOn June 14, 2016, Jiaqiang Xu was charged in the Southern District of New York in a six-count superseding indictment with economic espionage and theft […]

More Info

NSD201801-028

Satellite Trade Secrets to Undercover AgentOn July 7, 2016, in the Central District of California, Gregory Allen Justice was arrested by FBI special agents on federal charges of economic [...]

More Info
Powered by WordPress | theme SG Window