The Telecom Italia Big Data Challenge
Now is Open Data
Tell Me More

What is it?

the reason of this website

The Telecom Italia Big Data Challenge

At the beginning of 2014, Telecom Italia, in collaboration with several international partners, launched the Telecom Italia Big Data Challenge. The contest made available to developers, designers and scientists a large dataset of 30+ kinds of data (mobile, weather, energy, etc.)

... and now ...

We have now opened the data of the contest to everyone, to let anybody understand, study and generate new ideas. Discover more…

Open Data

These datasets are now freely available for anyone to use. We publish it as Open Data because we want people to reuse it!

F.A.Q.

Frequently Asked Questions

What kind of data are we opening up?

These data was used during the Big Data Challenge 2014, an online call for developers, researchers and designers from all over the world to come up with brand-new big data services and applications. The challenge was organized by Telecom Italia, in association with EIT ICT Labs, SpazioDati, MIT Media Lab, Polytechnic University of Milan, Fondazione Bruno Kessler,University of Trento and TrentoRISE.
The data provided in the dataset of the Big Data Challenge is geo-referenced (areas: Milan and the Autonomous Province of Trento – Italy) and anonymized. The dataset contains millions of records of data covering the period from November to December 2013 extracted from telecommunications records, energy, weather, public and private transport, social networks and events.


Under what license is the data available?

The data is released under ODbL license. The Open Database License (ODbL) explicitly covers data and not just creative works like photographs or text. A plain language summary of the ODbL is available on the Open Data Commons website. You are free:

  • to share: to copy, distribute and use the database;
  • to create: to produce works from the database;
  • to adapt: to modify, transform and build upon the database.

The ODbL also requires you to share any improvements you make to this databases under the ODbL as well. For example, if you combine the information with your own data, the resultant information must be published under the Attribution Share-Alike ODbL. This means that the whole community can benefit from both of our work on the database.
The ODbL requires you to attribute your use of this data. This is important as it allows us (and in turn the community) to benefit from reuse of our work, and so allows us to continue to provide this service. Use of any data must be accompanied by a hyperlink reading "from BigDataChallenge contest" and linking to either the ODI node Trento section homepage or the page referring to the information in question.


Why the data is published on dandelion.eu? Do visitors of the site need a login?

SpazioDati is one of the original technical partners of the “Big Data Challenge” contest. Its main role was to provide an affordable way to access to all the data related to the challenge and Dandelion is the original platform where all of this data was published.
It’s not the first time that some large datasets are made available to the public, through a controlled access: we can cite the public data sets published on Amazon S3, for example.
But it’s the first time that there is an official Open Data release starting from some Big Data sets: we know that it’s an hot topic.
Using your account on dandelion.eu to access the data, let us to collect some useful insights on the real demand side of the Open Data value chain.
We’ll publish these statistics of usage as Open Data, to make all the community involved more aware about the data value chain.It’s also useful to give some real perceptions on the Smart Cities and Smart Communities visions.


Why some datasets are published as raw data and others as APIs?

As you can see, the data was supplied in batch mode, using downloadable compressed files, or through API, if this kind of access is meaningful.
API data access allows a specific audience to use data more quickly, easily and efficiently when they are looking to do something specific with the information.


Can I copy this data and publish it on other websites? Can I redistribute it by using torrent, and P2P sharing networks?

Yes, of course. You can do anything you want, as you remain under the terms and conditions of the ODbL license conditions.


Get the Data

Most of the data refers to the period from November 2013 to December 2013 - download: https://dandelion.eu/datamine/open-big-data/

Milano Grid

Some of the datasets referring to the Milan urban area are spatially aggregated using a grid. We refer to this grid as the Milano Grid.

Social Pulse - Milano

This dataset contains data derived from an analysis of geolocalized tweets originated from Milan during the months of November and December.

Trentino Grid

Some of the datasets referring to the Trentino territory are spatially aggregated using a grid. We refer to this grid as the Trentino Grid.

Telecommunications - SMS, Call, Internet - TN

This dataset provides information about the telecommunication activity over the Province of Trento.

Telecommunications - TN to Provinces

This dataset provides information regarding the level of interaction between the Province of Trento and the Italian provinces.

Telecommunications - TN to TN

This dataset provides information regarding the directional interaction strength between the Province of Trento different areas based on the calls exchanged between Telecom Italia Mobile users.

Meteotrentino Weather Station‎ Data

This dataset contains measurements about temperature, precipitation and wind speed/direction taken in 36 Weather Stations.

Precipitation - Trentino

The dataset describes precipitation intensity over the province of Trento.

SET, Electricity

This dataset provides information about the current flowing through the electrical grid of the Trentino province.

TrentoToday

This dataset contains all the articles published on the website trentotoday.it from 01/11/2013 to 31/12/2013.

Telecommunications - SMS, Call, Internet - MI

This dataset provides information about the telecommunication activity over the city of Milano.

Social Pulse - Trentino

This dataset contains data derived from an analysis of geolocalized tweets originated from the Province of Trento during the months of November and December.

Telecommunications - MI to Provinces

This dataset provides information regarding the level of interaction between the areas of the city of Milan and the Italian provinces.

Telecommunications - MI to MI

This dataset provides information regarding the directional interaction strength between the city of Milan different areas based on the calls exchanged between Telecom Italia Mobile users.

Milano Weather Station‎ Data

The dataset describes various meteorological phenomena type and intensity of Milan city using sensors located within the city limits

Precipitation - Milano

The dataset describes precipitation intensity and type over the city of Milan

Air Quality - MI

The dataset describes the pollution type and intensity of Milan city using various types of sensors located within the city limits

MilanoToday

This dataset contains all the articles published on the website milanotoday.it from 01/11/2013 to 31/12/2013.

Administrative Regions

This dataset provides information about the current administrative regions in Europe. Original data sources include ISTAT and Eurostat data.

Partners

the team behind the scenes

Telecom Italia

SKIL Semantics & Knowledge Innovation Lab

SpazioDati

We breathe bigdata, we think in graphs, we speak semantics

Fondazione Bruno Kessler

FBK - ODI node Trento

The Joint Open Lab SKIL of Telecom Italia organizes the contest. SpazioDati is the technological partner hosting the data distribution platform. FBK takes is the scientific partner on big data and open data policy.

The Story

From the contest to the open data

  • January 2014

    The Telecom Italia Big Data Challenge starts

    Telecom Italia announces the Big Data Challenge: the first contest in Italy to stimulate the creation and development of innovative technological ideas from Big Data with the release of one of the largest heterogeneous Big Data set

  • April 2014

    The Big Data Jam

    The contest involved the participation of 1,100+ participants (652 teams and 105 universities) from all over the world. 100+ projects submitted. At the Big Data Jam, hosted in Trento, the three winning teams, one for each track (Data Analytics, Data Visualization and App Development) were announced: Easystats Ltd, the University of Trento and LocaliData, from London, Trento and Milano.

  • December 2014

    The Open Data is out!

    We believe in the power of Open Data and we then decided to release them in Open Data.

  • and now
    it's your
    turn!