Research Projects

The Center for Collective Learning is a multidisciplinary research laboratory focused on understanding the growth, accumulation, and value of socially embedded knowledge. For decades, our team has contributed to understanding patterns of economic diversification, growth, and development. We also have new areas of research focused on digital democracy and people’s perception of technology.

Economic Complexity

EZl-YU5XkAAxUfG.png

For decades, economics has relied on the use of statistics as its battle-horse for empirical analysis. Today, machine learning tools, from recommender systems to dimensionality reduction techniques, provide a new way to understand economic development.

During the last decade, the field of economic complexity emerged as an important quantitative tool for the understanding of economic geography, development, and innovation. Indicators, such as the Economic Complexity Index (ECI), have become popular statistics of the economic capacities of locations, and concepts such as relatedness, have helped improve our understanding of the diversification paths undertaken by national and regional economies.

The Center for Collective Learning works on the development and application of machine learning techniques applied to economic development. Building on our vast experience on the fields of economic complexity, economic development, and economic geography, we provide a new predictive lens to understand the evolution of economies, from cities to nations.

The Observatory of Economic Complexity is a tool that allows users to quickly compose a visual narrative about countries and the products they exchange. It was Alexander Simoes' Master Thesis in Media Arts and Sciences at the MIT Media Lab, which you can read here. The project was conducted at The MIT Media Lab Macro Connections group. Alex’s Advisor was César A. Hidalgo, principal investigator of Macro Connections. Since its creation in 2010, the development of The Observatory of Economic Complexity has been supported by The MIT Media Lab consortia for undirected research. For a history of the contributions to The Observatory of Economic Complexity, you can view the project’s contributions timeline on Github. A predecessor of the Observatory of Economic Complexity is the product space site, built by César Hidalgo as a graduate student at Notre Dame in 2007. If you would like more info on the OEC or to create a similar site for your country, state, or city, get in touch with us at oec@media.mit.edu.

The Observatory of Economic Complexity (OEC) (2011) makes more than fifty years of international trade data available through dozens of millions of interactive visualizations. It is the world’s most popular site to visualize international trade data.

The OEC was developed by Alexander Simoes as part of his requirement to complete his Masters in Media, Arts, and Sciences.

DataViva (2013) made available data for the entire economy of Brazil, including exports and imports for each municipality and product, and occupation data for every municipality, industry, and occupation. 

DataViva was created in a collaboration between FapeMIG, DataWheel, and the Macro Connections group at the MIT Media Lab.

WEBSITE www.datausa.io Bringing Government Data to Life The U.S. government offers almost 200,000 data sets for public use but they’re scattered, daunting and - most importantly - out of reach for the average citizen. DataUSA presents public data transparently and smoothly. Enter a new breed of open government data portal. To date, it is the largest, most comprehensive website for U.S. government data. DataUSA aggregates and visualizes data from multiple sources using a new and powerful visualization engine. Spanning a wide range of topics — labor and job markets, higher education, regional demographics, health care, and transportation — DataUSA is designed to enhance understanding, reveal patterns, and inform decision making. It provides a free, easy-to-use data platform that allows everyone from business executives and government policymakers to students and academic researchers to do their own data analyses, create content, embed charts, and find answers. DataUSA provides tools to transform data into stories about America — its people, places, industries, occupations, skill sets, and educational institutions.

DataUSA (2016) is a leading data visualization and integration engine for US public data. It integrates and distributes data from a variety of public sources. DataUSA was created in a collaboration between Datawheel, Deloitte, and the Collective Learning Group at MIT.

DataChile is a platform that integrates, distributes, and visualizes data from dozens of chilean sources.

DataChile (2018) integrates, distributes, and visualizes data from more than a dozen different official public sources. DataChile was constructed by Datawheel with the support of Antofagasta Minerals, Corfo, Chile's Ministry of Economy, and Entel.



Digital Democracy

Can digital technologies augment democracy?

At the Center for Collective Learning we design, test, and develop tools to support civic participation. Our team consists of talented engineers, lawyers, scientists, and designers, experienced in the design, testing, and development of digital tools.

Our contributions include the creation of collaborative government program generators, which we have tested experimentally in France and Brazil, and modular constitution builders, such as Constitutin, which allow citizens to create personalized and collective constitutions.

Screen Shot 2020-12-31 at 12.11.40 PM.png

We are interested in exploring new user interfaces and paradigms of participation, that help augment deliberation and collaboration among politically distant groups. We are also interested in participation paradigms that can increase the fluidity of the boundaries of participation. This includes non-binding digital forms of participation for non-citizen residents (from international scholars to refugees).

Our work builds on our vast experience in the design and implementation of complex online platforms. This expertise allows us to take an active approach to research on digital democracy, where we can learn through the design, implementation, and deployment, of digital platforms. Using on the ground, and controlled experiments, we are helping develop a solid understanding of how digital tools can augment and support the ideals of democracy.

Augmented Democracy (2018) is the idea of augmenting the participatory and decision making capacities of people by providing them with a digital twin. Here, you can get a gist of the idea on augmented democracy.

Participie (2012) was a design experiment on direct participation with constrained choices (like budgets). Participie was developed by Ali Almossawi as part of an independent study class.

Can we use Google Street View to Measure Inequality? This is the video summary of the paper The Collaborative Image of The City: Mapping the Inequality of Urban Perception. Salesses P, Schechtner K, Hidalgo CA (2013) The Collaborative Image of The City: Mapping the Inequality of Urban Perception. PLoS ONE 8(7): e68400. doi:10.1371/journal.pone.0068400

Place Pulse (2011) is a crowdsourcing effort to map urban perception. By asking users to select images from a pair, Place Pulse collects the data needed to evaluate people's perceptions of urban environments. This data is also the data used to train Streetscore.

Place Pulse was developed by Phil Salesses as part of his requirement to complete his master thesis. The present version of Place Pulse was re-engineered by Daniel Smilkov and Deepak Jagdish.

Networks and visualization

How does your social network changes overtime? What makes a language truly global? What shapes our collective memory as a species? Many of our previous research focused on such questions, but it all came down to one common phrase: connections.

Pantheon is a project by the MIT Media Lab Macro Connection's group that makes available data on historical cultural production through thousands of interactive visualizations. Access at pantheon.media.mit.edu.

Pantheon (2013) is an effort to map our species collective memory by structuring data on the biographies of globally famous indivdiuals. Pantheon 1.0 was developed by Amy Yu, in collaboration with Kevin Hu and Shahar Ronen. 

Immersion (2013) The current interface of emails is designed around time, and messages, pushing people to focus on what is more recent rather than important. Immersion is a design experiment that centers the email interface on people and the networks that people form. 

Immersion was developed by Daniel Smilkov and Deepak Jagdish as part of their requirement for a Masters in Media, Arts, and Sciences.

Global Language Network (2013) is a project that uses the structure of the networks connecting multilingual speakers and translated texts, as expressed in book translations, multiple language editions of Wikipedia, and Twitter, to provide a concept of language importance that goes beyond simple economic or demographic measures.

Global Language Network (paper) was written by Shahar Ronen, Bruno Gonçalves, Kevin Z. Hu, Alessandro Vespignani, Steven Pinker, and César A. Hidalgo.

Shout! (2016) Can I borrow your network? Shout! is a marketplace for retweets that allows people to exchange micro-contracts for future retweets. Shout! facilitates the coordination of social media diffusion efforts by groups.

Shout! was created together with Ambika Krishnamachar as part of her requirement for a Master's of Engineering.

Streetscore (2014) is a computer vision algorithm that estimates people's perception of urban environments. We have used Streetscore to create high resolution maps of urban perception with the goal of studying the social impact of urban perception, and also, to study urban change.

Streetscore was created together with Nikhil Naik. The Streetscore website was created together with Nikhil Naik and Jade Philipoom.

DIVE is a data visualization and analysis tool that integrates multiple steps of the data analysis pipeline created by Kevin Hu at MIT's Collective Learning group under the supervision of professor Cesar A. Hidalgo.

DIVE (2017) is a data integration and visualization engine that allows users to transform data into stories, by facilitating visualizations through recommendations, and providing an statistical tool including multivariate statistics. DIVE was created by Kevin Hu, as part of his Masters and PhD work at the Collective Learning group at the MIT Media Lab (paper)

OpenTeams is an open source platform to visualize and analyze data from teams. It uses data from email, as well as surveys of a team's demographics, diversity, and psychology. OpenTeams is available at openteam.info

OpenTeams (2018) is an open source platform to visualize team data. It is designed for email metadata, and also, includes validated surveys regarding personality (big five) and morals (moral foundation). You can access OpenTeams.