120 likes | 232 Vues
Short TC Report for the 2013 June AdCom Meeting (June 20, 2013). Data Mining Technical Committee (DMTC). Chair: Barbara Hammer Vice-Chairs: Carlotta Domeniconi Zhi-Hua Zhou. Short Overview. 41 members (15 new ones) main conference CIDM at SSCI’13 (46 papers out of 80 accepted)
E N D
Short TC Report for the 2013 June AdCom Meeting (June 20, 2013) Data Mining Technical Committee (DMTC) Chair: Barbara Hammer Vice-Chairs: Carlotta Domeniconi Zhi-Hua Zhou
Short Overview • 41 members (15 new ones) • main conference CIDM at SSCI’13 (46 papers out of 80 accepted) • 8 task forces (1 new: industrial applications) • DMTC meeting at CIDM’13 in Singapore • > 70 activities fostering DM as concerns organization of conferences, workshops, editing of books, special issues, … • In this short note, focus lies on two spotlights: • New symposium : Computational Intelligence for Big Data • Effort of task force on Process Mining: XES standardization
New Symposium on Big Data: CIBD at SSCI’14 • discussion at DMTC meeting at CIDM’13: • big data is hot topic, • but it is integral to DM in general, so not clear whether a separate symposium necessary/beneficial • discussions among SSCI’14 main chair (Haibo He), DMTC chairs (Barbara Hammer, Zhi-Hua Zhou, Carlotta Domeniconi), chair of task force on big data (Nitesh Chawla), potential chairs of CIBD Symposium (Yaochu Jin, Yonghong Peng), Hisao Ishibuchi • CIBD‘14: • There will be a separate symposium on big data, • co-organized by Yaochu Jin, Yonghong Peng, Nitesh Chawla, Marios Polycarpou at SSCI’14, • strongly supported by DMTC
XES standardization proposal driven by TF on process mining • Task force on Process Mining: • chair: Wil van der Aalst, homepage: http://www.win.tue.nl/ieeetfpm/doku.php • > 70 members, activities could make up an own TC • very strong interest from industry
XES standardization proposal driven by TF on process mining • What is Process Mining? • A small sample process model • and a small (!) • real one: • Examples: • company • production • web services • health care • administration • curricula • ...
XES standardization proposal driven by TF on process mining • Process = sequence of events • Process model = model which covers all possible sequences of events (and only those) • Process models are essential to structure, maintain, and document processes and supporting software • Problems: • no models • wrong models (not consistent with data) • inefficient processes • Process Mining: use data mining techniques to • infer process models (play in) • verify process models (play out, conformance check) • inspect process models and optimize processes (replay)
XES standardization proposal driven by TF on process mining • Process models and event logs: • Event log = set of sequences • of events and their characteristics
XES standardization proposal driven by TF on process mining • Process models and event logs: discovery algorithms such as alpha miner, heuristic miner, ... conformance checking mining ProM processmodel event log XES: standard to represent event log data businessprocessmodelling process
<?xml version="1.0" encoding="UTF-8" ?> <!-- XES version 1.0 --> <!-- Created by Fluxicon Nitro (http://fluxicon.com/nitro/ --> <!-- (c) 2010 Fluxicon Process Laboratories / http://fluxicon.com/ --> <log xes.version="1.0" xmlns="http://code.deckfour.org/xes" xes.creator="Fluxicon Nitro"> <extension name="Concept" prefix="concept" uri="http://code.deckfour.org/xes/concept.xesext"/> <extension name="Time" prefix="time" uri="http://code.deckfour.org/xes/time.xesext"/> <extension name="Organizational" prefix="org" uri="http://code.deckfour.org/xes/org.xesext"/> <global scope="trace"> <string key="concept:name" value="name"/> </global> <global scope="event"> <string key="concept:name" value="name"/> <string key="org:resource" value="resource"/> <date key="time:timestamp" value="2011-04-13T18:27:00.515+02:00"/> <string key="Activity" value="string"/> <string key="Resource" value="string"/> <string key="Costs" value="string"/> </global> <classifier name="Activity" keys="Activity"/> <classifier name="activity classifier" keys="Activity"/> <string key="creator" value="Fluxicon Nitro"/> <trace> <string key="concept:name" value="1"/> <string key="creator" value="Fluxicon Nitro"/> <event> <string key="concept:name" value="register request"/> <string key="org:resource" value="Pete"/> <date key="time:timestamp" value="2010-12-30T11:02:00.000+01:00"/> <string key="Activity" value="register request"/> <string key="Resource" value="Pete"/> <string key="Costs" value="50"/> </event> ... </trace> <trace> ... </trace> </log>
XES standardization proposal driven by TF on process mining • Pros: • standard to represent data for diverse process discovery algorithms • standard to represent data for play-out and analysis such as comparison, visualization • standard for representation of data stemming from business process models, possibility of data transformation for important systems currently in use (such as SAP) • extendable
XES standardization proposal driven by TF on process mining • XES – eXtensible Event Stream for interoperability in event logs • process mining as extremely relevant, relatively novel area of DM, combine process models with data in a principled way • backed up by public source extensible software such as ProM • XES closes the gap of data exchange which is extremely problematic in current applications / systems • a core group of about ten people already waits as volunteers to write the XES standardization proposal • strongly supported by and DMTC
DMTC chair’s plan for 2013/14 • encourage TC members and TF Chairs to propose tutorials, workshops, panels, competitions for SSCI 2014 and WCCI 2014 • encourage TC members and TF Chairs to propose special issues on CIS sponsored journals • foster task force Big Data, foster dedicated symposium CIBD • foster cross-links of TFs • foster balance of DMTC, TF members as regards geographic regions and gender • encourage members to think about new task forces, think about achievable tasks for existing ones