Difference between revisions of "SSDM 2017"

From Openresearch
Jump to: navigation, search
Line 39: Line 39:
  
 
==Workshop Chairs==
 
==Workshop Chairs==
  Wilhelmiina Hämäläinen, Academy of Finland/Department of Computer Science, Aalto University, Finland.
+
  ** [[has workshop chair::Wilhelmiina Hämäläinen]], Academy of Finland/Department of Computer Science, Aalto University, Finland
firstname.lastname@gmail.com Replace 'ä'('a with two dots') by 'a'
+
Wilhelmiina Hämäläinen, Academy of Finland/Department of Computer Science, Aalto University, Finland.
  Geoff Webb, Faculty of Information Technology, Monash University, Australia.
+
  ** [[has workshop chair:: ** [[has workshop chair::Wilhelmiina Hämäläinen]],]], Faculty of Information Technology, Monash University, Australia
firstname.lastname@monash.edu
+
 
  
 
==Programme Committee==
 
==Programme Committee==
Peter Flach, University of Bristol, UK
+
** [[has workshop chair::Peter Flach]], University of Bristol, UK
Wilhelmiina Hämäläinen, Aalto University, Finland
+
** [[has workshop chair::Wilhelmiina Hämäläinen]], Aalto University, Finland
Florian Lemmerich, University of Würzburg, Germany
+
** [[has workshop chair::Florian Lemmerich]], University of Würzburg, Germany
Cecile Low-Kam,  Montreal Heart Institute, Canada
+
** [[has workshop chair::Cecile Low-Kam]],  Montreal Heart Institute, Canada
Siegfried Nijssen,  Leiden University, Netherlands
+
** [[has workshop chair::Siegfried Nijssen]],  Leiden University, Netherlands
Francois Petitjean, Monash University, Australia
+
** [[has workshop chair::Francois Petitjean]], Monash University, Australia
Chedy Raissi, INRIA, France
+
** [[has workshop chair::Chedy Raissi]], INRIA, France
Jan Ramon, INRIA, France
+
** [[has workshop chair::Chedy Raissi]], INRIA, France
Jun Sese, AIST, CBRC, Japan
+
** [[has workshop chair::Jun Sese]], AIST, CBRC, Japan
Koji Tsuda, University of Tokyo/AIST, Japan
+
** [[has workshop chair::Koji Tsuda]], University of Tokyo/AIST, Japan
Geoff Webb, Monash University, Australia
+
** [[has workshop chair::Geoff Webb]], Monash University, Australia

Revision as of 23:34, 19 September 2016

SSDM 2017
SSDM 2017 : The 2nd ECML/PKDD 2016 workshop on Statistically Sound Data Mining
Dates 2016/09/19 (iCal) - 2016/09/19
Homepage: sites.google.com/site/whamalaipages/ssdm2016
Location
Location: Riva del Garda, Italy
Loading map...

Important dates
Submissions: 2016/07/17
Notification: 2016/07/25
Table of Contents


Motivation and objectives

Following on from the previous successful ECML/PKDD workshop SSDM'14 we will again bring together researchers in this significant and topical field.

The field of statistics has developed sophisticated, well-founded methods for inference from data. While some of these place computational or practical limits that make them infeasible to apply directly to many data mining problems, the field of data mining has much to gain from a more sophisticated understanding of the strengths and limitations of these techniques and from greater utilization of them where they are appropriate.

As an answer to this dilemma, there is emerging a clear trend towards statistically sound data mining. The main impetus for this new trend is coming from a third party, the application fields. In the computerized world, it is easy to collect large data sets but their analysis is more difficult. Knowing the traditional statistical tests is no more sufficient for scientists, because one should first find the most promising hidden patterns and models to be tested. This means that there is an urgent need for efficient data mining algorithms which are able to find desired patterns, without missing any significant discoveries or producing too many spurious ones. A related problem is to find a statistically justified compromise between underfitted (too generic to catch all important aspects) and overfitted (too specific, holding just due to chance) patterns. However, before any algorithms can be designed, one should first solve many principal problems, like how to define the statistical significance of desired patterns, how to evaluate overfitting, how to interprete the p-values when multiple patterns are tested, and so on. In addition, one should evaluate the existing data mining methods, alternative algorithms and goodness measures to see which of them produce statistically valid results.

As we can see, there are many important problems which should be worked together with people from Data mining, Machine learning, and Statistics as well as application fields. The goal of this workshop is to offer a meeting point for this discussion. We want to bring together people from different backgrounds and schools of science, both theoretically and practically oriented, to specify problems, share solutions and brainstorm new ideas.

To encourage real workshopping of actual problems, the workshop is arranged in a novel way, containing an invited lecture and inspiring groupworks in addition to traditional presentations. This means that also the non-author participants can contribute to workshop results. If you have relevant problems which you would like to be worked together in the workshop, please send them before the workshop.

Topics of Interest

Topics of interest include but are not limited to:

Useful and relevant theoretical results
Search methods for statistically valid patterns and models
Statistical validation of discovered patterns
Evaluating statistical significance of clustering
Statistical techniques for avoiding overfitted patterns
Scaling statistical techniques to high-dimensionality and high data quantity, covering both theoretical problems (like multiple testing problem) and computational problems (calculating required test measures efficiently)
Interesting applications with real world data demonstrating statistically sound data mining
Empirical comparisons between between different statistical validation methods and possibly other goodness measures

Insightful positition papers

We particularly encourage submissions which compare different schools of statistics, like frequentist (Neyman-Pearsonian or Fisherian) vs. Bayesian, or analytic vs. empirical significance testing. Equally interesting are submissions introducing generic school-independent computational methods. You can also submit papers describing works-in-progress.

Workshop Chairs

** Wilhelmiina Hämäläinen, Academy of Finland/Department of Computer Science, Aalto University, Finland

Wilhelmiina Hämäläinen, Academy of Finland/Department of Computer Science, Aalto University, Finland.

**

" ** [[has workshop chair" contains the "[" character as part of a property label and has been classified as invalid.

,]], Faculty of Information Technology, Monash University, Australia


Programme Committee

** Peter Flach, University of Bristol, UK
** Wilhelmiina Hämäläinen, Aalto University, Finland
** Florian Lemmerich, University of Würzburg, Germany
** Cecile Low-Kam,  Montreal Heart Institute, Canada
** Siegfried Nijssen,  Leiden University, Netherlands
** Francois Petitjean, Monash University, Australia
** Chedy Raissi, INRIA, France
** Chedy Raissi, INRIA, France
** Jun Sese, AIST, CBRC, Japan
** Koji Tsuda, University of Tokyo/AIST, Japan
** Geoff Webb, Monash University, Australia
Facts about "SSDM 2017"
AcronymSSDM 2017 +
End dateSeptember 19, 2016 +
Event typeConference +
Has coordinates45° 53' 5", 10° 50' 23"Latitude: 45.884769444444
Longitude: 10.839694444444
+
Has location cityRiva del Garda +
Has location countryCategory:Italy +
Homepagehttp://sites.google.com/site/whamalaipages/ssdm2016 +
IsAEvent +
NotificationJuly 25, 2016 +
Start dateSeptember 19, 2016 +
Submission deadlineJuly 17, 2016 +
TitleSSDM 2017 : The 2nd ECML/PKDD 2016 workshop on Statistically Sound Data Mining +