At the beginning of 2014, Telecom Italia, in collaboration with several international partners, launched the Telecom Italia Big Data Challenge. The contest made available to developers, designers and scientists a large dataset of 30+ kinds of data (mobile, weather, energy, etc.)
We have now opened the data of the contest to everyone, to let anybody understand, study and generate new ideas. Discover more…
These datasets are now freely available for anyone to use. We publish it as Open Data because we want people to reuse it!
These data was used during the Big Data Challenge 2014, an online call for developers, researchers and designers from all over the world to come up with brand-new big data services and applications. The challenge was organized by Telecom Italia, in association with EIT ICT Labs, SpazioDati, MIT Media Lab, Polytechnic University of Milan, Fondazione Bruno Kessler,University of Trento and TrentoRISE.
The data provided in the dataset of the Big Data Challenge is geo-referenced (areas: Milan and the Autonomous Province of Trento – Italy) and anonymized. The dataset contains millions of records of data covering the period from November to December 2013 extracted from telecommunications records, energy, weather, public and private transport, social networks and events.
The data is released under ODbL license. The Open Database License (ODbL) explicitly covers data and not just creative works like photographs or text. A plain language summary of the ODbL is available on the Open Data Commons website. You are free:
The ODbL also requires you to share any improvements you make to this databases under the ODbL as well. For example, if you combine the information with your own data, the resultant information must be published under the Attribution Share-Alike ODbL. This means that the whole community can benefit from both of our work on the database.
The ODbL requires you to attribute your use of this data. This is important as it allows us (and in turn the community) to benefit from reuse of our work, and so allows us to continue to provide this service. Use of any data must be accompanied by a hyperlink reading "from BigDataChallenge contest" and linking to either the ODI node Trento section homepage or the page referring to the information in question.
SpazioDati is one of the original technical partners of the “Big Data Challenge” contest. Its main role was to provide an affordable way to access to all the data related to the challenge and Dandelion is the original platform where all of this data was published.
It’s not the first time that some large datasets are made available to the public, through a controlled access: we can cite the public data sets published on Amazon S3, for example.
But it’s the first time that there is an official Open Data release starting from some Big Data sets: we know that it’s an hot topic.
Using your account on dandelion.eu to access the data, let us to collect some useful insights on the real demand side of the Open Data value chain.
We’ll publish these statistics of usage as Open Data, to make all the community involved more aware about the data value chain.It’s also useful to give some real perceptions on the Smart Cities and Smart Communities visions.
As you can see, the data was supplied in batch mode, using downloadable compressed files, or through API, if this kind of access is meaningful.
API data access allows a specific audience to use data more quickly, easily and efficiently when they are looking to do something specific with the information.
Yes, of course. You can do anything you want, as you remain under the terms and conditions of the ODbL license conditions.
Some of the datasets referring to the Milan urban area are spatially aggregated using a grid. We refer to this grid as the Milano Grid.
This dataset contains data derived from an analysis of geolocalized tweets originated from Milan during the months of November and December.
Some of the datasets referring to the Trentino territory are spatially aggregated using a grid. We refer to this grid as the Trentino Grid.
This dataset provides information about the telecommunication activity over the Province of Trento.
This dataset provides information regarding the level of interaction between the Province of Trento and the Italian provinces.
This dataset provides information regarding the directional interaction strength between the Province of Trento different areas based on the calls exchanged between Telecom Italia Mobile users.
This dataset contains measurements about temperature, precipitation and wind speed/direction taken in 36 Weather Stations.
This dataset provides information about the current flowing through the electrical grid of the Trentino province.
This dataset contains all the articles published on the website trentotoday.it from 01/11/2013 to 31/12/2013.
This dataset provides information about the telecommunication activity over the city of Milano.
This dataset contains data derived from an analysis of geolocalized tweets originated from the Province of Trento during the months of November and December.
This dataset provides information regarding the level of interaction between the areas of the city of Milan and the Italian provinces.
This dataset provides information regarding the directional interaction strength between the city of Milan different areas based on the calls exchanged between Telecom Italia Mobile users.
The dataset describes various meteorological phenomena type and intensity of Milan city using sensors located within the city limits
The dataset describes precipitation intensity and type over the city of Milan
The dataset describes the pollution type and intensity of Milan city using various types of sensors located within the city limits
SKIL Semantics & Knowledge Innovation Lab
We breathe bigdata, we think in graphs, we speak semantics
FBK - ODI node Trento
The Joint Open Lab SKIL of Telecom Italia organizes the contest. SpazioDati is the technological partner hosting the data distribution platform. FBK takes is the scientific partner on big data and open data policy.
Telecom Italia announces the Big Data Challenge: the first contest in Italy to stimulate the creation and development of innovative technological ideas from Big Data with the release of one of the largest heterogeneous Big Data set
The contest involved the participation of 1,100+ participants (652 teams and 105 universities) from all over the world. 100+ projects submitted. At the Big Data Jam, hosted in Trento, the three winning teams, one for each track (Data Analytics, Data Visualization and App Development) were announced: Easystats Ltd, the University of Trento and LocaliData, from London, Trento and Milano.
We believe in the power of Open Data and we then decided to release them in Open Data.