Category Archives: Announcement

Why to manage and share research data?

Tomi Kauppinen, a blog post for his keynote on “How to manage and share Spatiotemporal Research Data?  Supporting learning and reproducibility online via Linked Open Science.” at the The 3rd LEARN workshop on Research Data Management“Make research data management policies work”, organized by the EU-funded project, LEARN (Leaders Activating Research Networks), Helsinki, June 28th, 2015. 

presentation-headline

Why to manage and share research data?

With open data taking on and also open access (to publications), the big question remains: where is open science? I argue that for open science to really fly we need both the

  1. open research data = data used or produced by scientific efforts
  2. open accessible methods = methods in publications made reproducible

But how to do this? By whom, where and when? Essentially, first we need to answer the “why” questions – i.e. figuring out the excellent incentives – and then the other important questions (who, what, where, when, how) will naturally follow.

The “why” question calls us to think about the

  1. Incentives for a researcher to open their data. Thus: why would a researcher open his/her data for others? Is it enough that many journals (e.g. PLOS One, see our article as an example) now require data to be available?
  2. Incentives for funders and research managers to request opening of research data. Thus: why would decision-makers ask for the open data?
  3. Incentives for the society to ask for open data. Thus: why is it useful to have open research data?

Learning as the key term to answer the why questions

If we look at these why-questions there is an interesting answer that covers all of them. The answer to create incentives for opening research data and enabling reproducibility includes a key term, that is,  learning.

Interestingly, learning largely happens via reproducing existing efforts (just think about all the text books and their numerous examples with enumerated steps for reproducing success).

Thus if we manage to reach the learning layer, the reproducibility will follow.

Now let us get back to our “why” questions, and start from the “why” question number 3: what if we agree that the society at large wants to learn about what science produces (like  educating citizens to be well-informed about the world, educating students to be masters in their fields or educating companies to develop new systems and explore growth options)?

The society calls for better ways to to support learning, and preferably online as we are now living in the connected world.

Now we get an answer also for the “why” question number 2: the funders and managers act as the representatives of the society and listen for the requirements. Decision-makers are already in many countries requiring data to be managed and open (for instance NSF in USA with their requirement for the Data Management Plan).  However, as reported just recently by an expert group for the European Open Science Cloud  there is still ” an alarming lack of reproducibility of current published research”.  Thus after carefully listening the society decision-makers should increasingly ask for the learning and reproducibility layers as a prerequisite for positive funding agreements.

Incentivizing researchers via learning and communication settings

Now the last but not least “why” question number 1 concerning our researcher. Clearly, the availability of funding creates an incentive for the researcher to support reproducing of the research, and thus a proper research data management allowing to do so.  However, there is a bigger and better answer to the why question. Science is communication and so is learning. If we allow the researcher to move from the rather tedious task of “just research data management” to be able to allow others to learn (students, citizens, company people) how to in fact reproduce interesting research settings the picture is suddenly quite completely different.

Indeed, many researchers are also teachers and look for excellent ways for communicating what they feel is important for students to learn about. By creating a culture-shift towards online learning and reproducibility by utilizing excellent research data we thus create big incentives  for researchers to engage themselves in proper research data management.

Let us check some examples

As for examples there is the LODUM – Linked Open Data University of Münster project where we showed how to create the data infrastructure and the learning layer. The data created as part of LODUM has been in use by not only many student projects but also by new funded projects. As an example below is a visualization showing the amount of publications by university buildings (Keßler and Kauppinen, 2012).

lodum-productivity
Publications analyzed by buildings depict big differences among them.

Clearly creating of useful research data management schemes via opening linked data online calls for a culture-shift from traditional paper-as-the-end-result kind of publishing. To answer this call, Linked Open Science is an approach to enable interconnecting of scientific assets for allowing reproducibility and learning to happen.

Linked Open Science?

Linked Open Science (Kauppinen and de Espindola 2011) builds on the four key elements:

  • Linked Data: Input data, results and provenance information are published and archived using the Linked Data principles.
  • OpenSource and Web-based Environments: Methods are written for publication in open source environments.
  • Cloud Computing: The execution of methods and access to various resources are provided using the Cloud Computing approach.
  • Creative Commons: CC Licensing is in use to provide the legal and technical infrastructure for scientific assets.

This allows for creating of greater reproducibility environments where students and researchers can learn and explore new questions. In the context of complex phenomena such as the Brazilian Amazon Rainforest one can ask: How to link ecological, economical and social data? (Kauppinen et al. 2014)  What related processes can we evidence about the Brazilian Amazon Rainforest by interacting with visualizations? (Bartoschek et al. 2013). For this, tutorials of LinkedScience.org support  online learning.

gdp2005
An example visualization built on top of the Linked Brazilian Amazon Rainforest Data depicting the relation between GDP (the heights) and deforestation rates (red=more deforestation).

 

How does science work?

Further on, by studying scientific assets that are interconnected according to the Linked Science approach, it could perhaps be possible to find interesting laws about how science itself works. For instance, lately we analyzed data on 100 000 participations of scientists in conferences to reveal the associative nature of conference participation (Smiljanić, Chatterjee,  Kauppinen, Mitrović Dankulov 2016). See below a figure made to illustrate the idea in a visual way, and thus support learning about the research finding.

Storytelling via an example to illustrate the associative nature of conference participation
Here we illustrate the idea of the associative nature of conference participation via a simple example. Jim participated in a conference twice, then skipped one and participated once again, but did not participate at all after that. Tim participated the first five times and, although he skipped one conference, he then participated three times. The colors illustrate the likelihood to participate (red more probable, blue less probable).

To summarize

  • We need to focus on why-questions to find true incentives for different parties (researchers, decision-makers, citizens) to do and require proper research data management
  • As we discussed, learning is a great incentive as it requires good communication, and in essence often also reproducibility built on research data
  • Linked Open Science is an approach to interconnect scientific assets and to support reproducibility and learning
  • There is a big potential for research on understanding how science itself works by analyzing the traces left by researchers and scientific assets they produce.

Please contact via  @LinkedScience. The slides for this LEARN keynote are available online.

Links:

References

 

Opening Reproducible Research project is hiring

Opening Reproducible Research (ORR) project at the Institute for Geoinformatics, University of Münster, Germany has announced the following two open positions (deadline for applying October 15, 2015):

If your institute has open positions related to Open Science or Linked Science (or both!), please share news about them to us via @LinkedScience or tomi.kauppinen@aalto.fi and we will add them to LinkedScience.org/jobs.

 

Five papers accepted to COSIT workshop on Teaching Spatial Thinking

We accepted the following five papers to be presented at the Workshop on Teaching Spatial Thinking from Interdisciplinary Perspectives at COSIT2015:

Announcement by Tomi Kauppinen (co-chair), on behalf of the organizing committee.

Six papers accepted to Linked Science 2015

We are happy to announce that the following papers were accepted to this year’s Workshop on Linked Science organized at ISWC2015 in Bethlehem, Pennsylvania, USA on October 12th, 2015.

  • Tony Hammond and Michele Pasin. The nature.com ontologies portal
  • Da Huo, Jaroslaw Nabrzyski and Charles Vardeman. An Ontology Design Pattern towards Preservation of Computational Experiments
  • Carsten Keßler. Using the Web as a Data Source: Challenges for Linked Science
  • Tobias Kuhn. nanopub-java: A Java Library for Nanopublications
  • Paulo Pinheiro, Deborah McGuinness and Henrique Santos. Human-Aware Sensor Network Ontology: Semantic Support for Empirical Data Collection
  • Rui Yan, Brenda Praggastis, William Smith and Deborah McGuinness. Towards Cache Maintenance for Ontology Based, History-Aware Stream Reasoning

Announced by Tomi Kauppinen, Co-chair of the 5th Workshop on Linked Science 2015— Best Practices and the Road Ahead (LISC2015)

Teaching Spatial Thinking from Interdisciplinary Perspectives

Workshop on Teaching Spatial Thinking from Interdisciplinary Perspectives (SPATIALTHINKING2015)

When: October 12, 2015
Where: Santa Fe, New Mexico, USA
Collocated with Conference on Spatial Information Theory XII (COSIT 2015)
Workshop URI: http://linkedscience.org/events/spatialthinking2015/
Hashtag: #SpatialThinking2015

The “Teaching Spatial Thinking from Interdisciplinary Perspectives” (SPATIALTHINKING2015)  workshop’s goals are to:
1) Assist educators in developing interdisciplinary courses on spatial thinking.
2) Develop a repository of educational materials that educators could use to create interdisciplinary courses on spatial thinking.

Organizers of SPATIALTHINKING2015 are Heather Burte (UCSB), Tomi Kauppinen (Aalto Uni) and Mary Hegarty (UCSB).

Read more and welcome to join!

 

 

Tutorial on Visual Analytics at ESWC2015

We are happy to announce that we will arrange the Tutorial on Visual Analytics with Linked Open Data and Social Media (VisLOD2015) at ESWC2015 in Portoroz, Slovenia on May 31 or June 1, 2015.

In the tutorial we will focus on mining and visualizing of interesting spatial, temporal and thematic patterns from Linked Open Data and Social Media.

The teachers of the tutorial are Dr.  Suvodeep Mazumdar (Uni Sheffield), Dr  Tomi Kauppinen (Aalto Uni) and Dr.   Anna Lisa Gentile (Uni Sheffield).

[more information on ViSLOD2015…]

The program of #VISUAL2014

We are happy to announce the program of VISUAL2014 (International Workshop on Visualizations and User Interfaces for Knowledge Engineering and Linked Data Analytics).

When: November 24, 2014
Where: Linköping, Sweden
Collocated with EKAW2014, 19th International Conference on Knowledge Engineering and Knowledge Management
Workshop URI: http://linkedscience.org/events/visual2014/
Hashtag: #VISUAL2014

Workshop Program

09:15 – 09:25: Opening
09:25 – 10:30: Session I: Ontology Visualization (Session Chair: Valentina Ivanova)
09:25 – 09:50: A Vision for Diagrammatic Ontology Engineering (full paper), Gem Stapleton, John Howse, Adrienne Bonnington, Jim Burton
09:50 – 10:15: OntoViBe – An Ontology Visualization Benchmark (full paper), Florian Haag, Steffen Lohmann, Stefan Negru, Thomas Ertl
10:15 – 10:30: Discussion Session I

10:30 – 11:00: Coffee Break

11:00 – 12:30: Session II: User-Oriented Ontology Alignment (Session Chair: Steffen Lohmann)
11:00 – 11:20: What Can the Ontology Describe? Visualizing Local Coverage in PURO Modeler (short paper), Marek Dudas, Tomas Hanzal, Vojtech Svatek
11:20 – 11:45: User Involvement for Large-Scale Ontology Alignment (full paper), Valentina Ivanova, Patrick Lambrix
11:45 – 12:00: Discussion Session II
12:00 – 12:30: Wrap Up Sessions I+II

12:30 – 14:00: Lunch Break

14:00 – 15:00: Session III: Visual Approaches to Linked Data (Session Chair: Valentina Ivanova)
14:00 – 14:25: Sensemaking on Wikipedia by Secondary School Students with SynerScope (full paper), Willem Robert Van Hage, Fernando Nunez-Serrano, Thomas Ploeger, Jesper Hoeksema
14:25 – 14:45: Towards a Visual Annotation Tool for End-User Semantic Content Authoring (short paper), Torgeir Lebesbye, Ahmet Soylu
14:45 – 15:00: Discussion Session III

15:00 – 15:30: Coffee Break

15:30 – 17:00: Session IV: Demo Jam (Session Chair: Steffen Lohmann)
15:30 – 16:30: Impromptu demos (everyone is invited to join and present)
16:30 – 17:00: Wrap Up Sessions III+IV
17:00 – End of Workshop

Visualizing and Animating Large-scale Spatiotemporal Data with ELBAR Explorer

Visual exploration of data enables users and analysts observe interesting patterns that can trigger new research for further investigation. With the increasing availability of Linked Data, facilitating support for making sense of the data via visual exploration tools for hypothesis generation is critical. Time and space play important roles in this because of their ability to illustrate dynamicity, from a spatial context. Yet, Linked Data visualization approaches typically have not made efficient use of time and space together, apart from typical rather static multivisualization approaches and mashups. We developed ELBAR explorer that visualizes a vast amount of scientific observational data about the Brazilian Amazon Rainforest. The core contribution is a novel mechanism for animating between the different observed values, thus illustrating the observed changes themselves.

ELBAR-explorer will be demoed at ISWC2014 in October, 2014. The following paper will give more details:

Announcement: program of Geographic Information Observatories 2014, Vienna

We are happy to announce the program of the Workshop on Geographic Information Observatories 2014 to be organized at GIScience 2014 on September 23rd, 2014 in Vienna:

  • 08:30 – 09:00    Registration / Welcome to GIO2014
  • 09:00 – 09:15    Intro
  • 09:15 – 10:00    Keynote I: (Chair: Ben Adams)
    • Brent Hecht.The Mining and Application of Diverse Cultural Perspectives in Volunteered Geographic Information and User-Generated Content
  • 10:00 – 10:30    Coffee break
  • 10:30 – 11:15   Paper session I (10+5min per speaker) (Chair: Grant McKenzie)
    • Andrea Ballatore. Exploring the geographic information universe: The role of search technologies
    • Benjamin Adams, Mark Gahegan, Prashant Gupta and Richard Hosking. Geographic Information Observatories for Supporting Science
    • André Bruggmann and Sara Irina Fabrikant. Spatializing a Digital Text Archive about History
  • 11:15 – 12:30    Observatory Demos and Panel (Chair: Krzysztof Janowicz)
  • 12:30 – 13:30    Lunch
  • 13:30 – 14:15    Keynote II: (Chair: Krzysztof Janowicz)
    • Sven Schade.‘Post-Normal’ Geospatial Science
  • 14:15 – 15:00    Paper session II (10+5min per speaker) (Chair: Tomi Kauppinen)
    • Auriol Degbelo and Werner Kuhn. Five General Properties of Resolution
    • Heidelinde Hobel and Andrew U. Frank. Exploiting Linked Spatial Data and Granularity Transformations
    • Bandana Kar and Rina Ghose. Is My Information Private? Geo-Privacy in the World of Social Media
  • 15:00 – 15:30    Discussion about kinds of GIO (incl. infrastructure, community involvement, funding) (Chair: Ben Adams)
  • 15:30 – 16:00    Coffee break
  • 16:00 – 17:00    Break out groups on GIO research agenda (Chair: Tomi Kauppinen)
  • 17:00 – 17:30    Wrap-up of the results of the groups / joint writing  (Chair: Grant McKenzie)
  • 18:00 – 20:00    All-Workshop Icebreaker

Registration for the workshop and the conference is now open at the GIScience2014 web site (early bird until August 11th, 2014).

The workshop on Geographic Information Observatories 2014 is organized by Krzysztof Janowicz (University of California, Santa Barbara, USA), Ben Adams (University of Auckland, NZ), Grant McKenzie (University of California, Santa Barbara, USA) and
Tomi Kauppinen (University of Bremen, Germany and Aalto University School of Science, Finland)

Making Sense of Data is the theme of Linked Science 2014

We will organize the 4th Workshop on Linked Science 2014 (LISC2014) with the focus on theme Making Sense Out of Data. LISC2014 will collocate with ISWC2014 in Riva del Garda, Trentino, Italy on October 19 or 20, 2014.

We encourage submissions on both

  1. new results through making use of semantic reasoning or
  2. making innovative combination of existing technologies (such as visualization, data mining, machine learning, and natural language processing) with Semantic Web technologies to enable better understanding of data.

LISC2014 is organized by

– Jun Zhao, Lancaster University
– Marieke van Erp, VU University Amsterdam
– Carsten Keßler, Hunter College, City University of New York
– Tomi Kauppinen, University of Bremen
– Jacco van Ossenbruggen, CWI and
– Willem Robert van Hage, SynerScope B.V.

Please check the LISC2014 pages for more information and Call for Papers.