...
Leveraging the data potential in Europe from EC
digital dilema: media with higher capacity have lower lifespan
- we're producing now lots of data but storing them on short life-span media
More data bring work on>
- increased BI in public and private sector
- world class applications???
- data policy and data reusing
- multilingual data infrastructure
implementation PSI policy across Europe ensuring compliance
why data matter> - 140Ebln revenue to EU27 in open public data biznis
- better governance and citizen empowerment
- accelerating scientific progress
...
- eurofit platform > dev of new app, single point of access to data, webservices, solid BM isize portal
- Gapfiller web portal for gnss communityfor UNI, for gps developers, simulations
- sopcawind.eu project - SOPCAWIND - SOFTWARE FOR THE OPTIMAL PLACE CALCULATION FOR WIND-FARMS
- Plan4Business.eu - A service platform for aggregation, processing and analysis of urban and regional planning data
- SimpleFleet - Democratizing Fleet Management
- Vista-TV - Video Stream Analytics for Viewers in the TV Industry
- DOPA - Data Supply Chains for Pools, Services and Analytics in Economics and Finance
Keynotes
Jimmy Kevin Pedersen, Agency for Digitization, Ministry of Finance
"New trends in public service and Citizens Communication"
naklady na selfservice su mensie, ako riesenie veci cez email,alebo personal
:
Jasper Hedegaard Bojsen, Technical Director, Microsoft Denmark
"The cloud, Big Data and Great Opportunities"
Windows azure todo???
BING
HADOOP > apache
Simon Riggs, 2ndQuadrant > platinum sponsor, leading contributors, OSB, HQ in eu
"Open Data, Open Database: PostgreSQL"
open data example newteon and darwin > resue of opend data but research didnd t share
Postgre extension
postgis,
PL/R analytic
AXLE> advanced analytics for BIG EDU data
they are looking for partners with big open data, clear use cases, high benefits, November 2012+ simon@2ndquadrant.com,
Florian Bauer, Renewable Energy and Energy Efficiency Partnership (REEEP)
"Using LOD to share clean energy data and knowledge"
reegle.info, link definitions from diff sources > free datasets ready for reuse for every country
LOD> splits responsibilities for datasets, reduces redundancy
* pdf
** excel
*** csv
**** uri
***** LOD linked datasets bring to the context
W3C
Joao Rodrigues Frade, PwC Belgium >xthml % standard,
"Enabling open data interoperability - The case for the Core Business Vocabulary"
joinup platform > vocabularies, standards, taxonomies
ADMS describing metadata,
national register > business core vocabulary
- company type
- company status
- company activity
Ogranization ontology
why use BCV > jasne identifikatory, semantic, aid interoperability, link a legal entiti with its registered address prvide in an inspire conformant way
www.w3c.org/ns/adms#
- BIOpool - sharing of histological images and clinical data, aggregation, indexing and searching
- Fusepool - linmig lab, technology sourcing, ...
- PortDial - speech recording, grammar and ontologies about language, ...
- SmeSpire - establishing infrastructure for geospatial data so that environment policies can work on such data from whole EU
- CODE (Commercially Empowered Linked Open Data Ecosystems in Research) - "we share knowledge to create new knowledge" - 13 TB of research publications + onlotologies + ...
- EUCLID - preparing educational curriculum for usage of LinkedData
Keynotes
Andreas Both: From data-driven start-up to large company in decade
success story:
- being capable of integrating many datasets
- user focused data
having say 10x more data can mean 1000x longer analysis time
Hadoop: talent gap
Jimmy Kevin Pedersen
Agency for Digitization, Ministry of Finance
"New trends in public service and Citizens Communication"
naklady na selfservice su mensie, ako riesenie veci cez email,alebo personal
:
Jasper Hedegaard Bojsen
Technical Director, Microsoft Denmark
"The cloud, Big Data and Great Opportunities"
Windows azure todo???
BING
HADOOP > apache
Simon Riggs
2ndQuadrant > platinum sponsor, leading contributors, OSB, HQ in eu
"Open Data, Open Database: PostgreSQL"
open data example newteon and darwin > resue of opend data but research didnd t share
Postgre extension
postgis,
PL/R analytic
AXLE> advanced analytics for BIG EDU data
they are looking for partners with big open data, clear use cases, high benefits, November 2012+ simon@2ndquadrant.com,
Florian Bauer
Renewable Energy and Energy Efficiency Partnership (REEEP)
"Using LOD to share clean energy data and knowledge"
reegle.info, link definitions from diff sources > free datasets ready for reuse for every country
LOD> splits responsibilities for datasets, reduces redundancy
* pdf
** excel
*** csv
**** uri
***** LOD linked datasets bring to the context
W3C
Joao Rodrigues Frade
PwC Belgium >xthml % standard,
"Enabling open data interoperability - The case for the Core Business Vocabulary"
joinup platform > vocabularies, standards, taxonomies
ADMS describing metadata,
national register > business core vocabulary
- company type
- company status
- company activity
Ogranization ontology
why use BCV > jasne identifikatory, semantic, aid interoperability, link a legal entiti with its registered address prvide in an inspire conformant way
www.w3c.org/ns/adms#
www.w3.org/2012/Talks/0606_phila_edf
...
Centrum Wiskunde & Informatica (CWI)
In this talk, we present SRBench, the first benchmark for Streaming RDF Storage Engines, which is completely based on real-world datasets. With the increasing problem of too much streaming data but not enough tools to gain and even derive knowledge from those data, researchers have set out for solutions in which Semantic Web technologies are adapted and extended for the publishing, sharing, analysing and understanding of such data. Various approaches are emerging, , e.g., C-SPARQL, SPARQLStream, StreamSPARQL and CQELS. To help researchers and users to compare streaming RDF engines in a standardised application scenario, we propose SRBench, with which one can assess the abilities of a streaming RDF engine to cope with a broad range of use cases typically encountered in real-world scenarios. The design of SRBench is based on an extensive study of the state-of-the-art techniques in both the data stream management systems and the streaming RDF processing engines, and the existing RDF/SPARQL benchmarks. This ensures that we capture all important aspects of streaming RDF processing in the benchmark.
The first goal of SRBench is to evaluate the functional completeness of a streaming RDF engine. The benchmark contains a concise, yet comprehensive set of queries which covers the major aspects of streaming SPARQL query processing, ranging from simple pattern matching queries to queries with complex reasoning tasks. The main advantages of applying Semantic Web technologies on streaming data include providing better search facilities by adding semantics to the data, reasoning through ontologies, and integration with other data sets. The ability of a streaming RDF engine to process these distinctive features is accessed by the benchmark with queries that apply reasoning not only over the streaming sensor data, but also over the metadata and even other data sets in the Linked Open Data (LOD) cloud.
To give a first baseline and illustrate the state of the art, we show results obtained from implementing SRBench using the Polit cnica de Madrid (UPM). The engine supports the streaming RDF query language, also called SPARQLStream. The evaluation shows that the functionality supported by SPARQLStream is fairly complete. At the language level, it is able to express all benchmark queries easily and concisely. At the query processing level, some missing features have been discovered, for all of which preliminary code has been added for further development.
Irini Fundulaki: Abstract Access Control Model for Dynamic RDF Datasets
do not store access value with each triplet: it's costly and hard to update
use tokens and store formulas use to compute value for the token
Big Big Data Public Private Forum (BIG) initiative
...
Networking for European ICT Research & Development
Notes from Hackaton
Projects offering help and showcasing at the Hackaton:
- PlanetData: GADM-RDF, cumulus rdf, SEC Edgar Linked Data Wrapper, geometry2rdf, morph, map4rdf
- Open Bank Project: raising bar for financial transparency
- LOD2
- IKS: semanticaly enabled CMS
- LMF (Linked Media Framework)
What we've participated in, tried, ...:
- LMF showcase: what an LMF developer can do with some data from Slovakia
- creation of ontology about power distribution in Bulgaria: how is power derived from The People to pariament, government, ministries, ...
- geting to know better LOD: Virtuoso, sizing, scalability, caching, ...
- VIE.js - Semantic Interaction Framework
...
Some take-away material from the conference:
...