AusDM12

Home
CFP
Submission
Dates
Program
Presentations
Register
Organisers

AusDM
Togaware

Invited and Accepted Papers

Keynotes

DATA = NORMAL + ANOMALOUS + NOISE
Professor Sanjay Chawla
Abstract: Our world at the micro, macro and personal level is now highly instrumented. A consequence of this instrumentation is that now it is possible to obtain fine-grained data about almost anything of interest. Once we focus on an application or a domain, it is reasonable to assume that much of the data obtained captures the "normal" behavior of the underlying phenomenon. Historically, "knowledge discovery," if any, has been triggered by the non-normal or anomalous part of the data. In this talk I will present some classic examples of data anomalies and how their discovery has changed our understanding of the world. Then I will present a modern and algorithmic viewpoint of anomaly detection as is currently practiced in the data mining community.
Sanjay Chawla is the Professor of Pattern and Data Mining at the University of Sydney. He served as the Head of the School of Information Technologies from 2008-2011. His research work has appeared in leading international conferences and journals. Along with his students, he has received four best paper awards in the last six years. In 2012, he was the program co-chair of the 16th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD). He serves on the editorial board of the journal DMKD and is a member of the ACM.
Non-iidness: Coupled Object and Pattern Analysis
Professor Longbing Cao
Most of existing data mining algorithms are based on the IID assumption, which treats objects independently from each other. In the real world, objects are either loosely or tightly coupled with each other. For instance, a moving vehicle on the street interacts with the cars before and after it, and the ones on its left and right hand sides if any. In social networks, people interact with each other at different levels for varied purposes. Such interactions, or coupling relationships, are ubiquitous, and spread at various levels, between objects, between attributes describing an object, between attribute values within an attribute. It is crucial to cater for such relations in object analysis.
On the other hand, the usual patterns identified by data mining are based on independent objects or items. For instance, often a large number of frequent patterns are mined by the existing algorithms, which are often treated as independent with each other. In fact, due to the object coupling relationships, patterns are associated with each other in structural and/or semantic aspects. Pattern relationship analysis is often ignored.
In this talk, we will explore the needs, challenges, opportunities of analyzing complex object relations and complex pattern relations. On top of a framework for noniid-based coupled object and pattern analysis, several corresponding techniques will be introduced: coupled object analysis to define and quantify the coupling relationships within and between objects and within and between attributes, combined pattern mining to identify a group of patterns coupled by certain relationships. Coupled behavior analysis will be explored to analyse a group of actors’ behaviors. We will show how such new frameworks outperform the classic iid-based data mining framework in terms of handling complex data, behavior, relation, environment and pattern in clustering, frequent pattern mining, and classification. Several real-life applications will be given, such as the identification of group-based market manipulations in stock markets.
Dr Longbing Cao is a professor of information technology at the University of Technology Sydney (UTS) Australia. He got PhD in Intelligent Sciences and PhD in Computing Science. He is the Founding Director of Advanced Analytics Institute, one of three university research institutes at UTS. AAI is the largest analytics research group in Australia, with the widest and deepest engagement with major local and overseas government and industry organizations in many sectors and domains. Prof Cao is also the Research Leader of the Data Mining Program at the Australian Capital Markets Cooperative Research Centre. He is a Senior Member of IEEE, SMC Society and Computer Society.
Prof Cao's primary research interests include data mining and machine learning, and artificial intelligence and intelligent systems. He initiated and is leading the research on behavior informatics, domain driven data mining, agent mining, and open complex intelligent systems. He has published 2 monographs, 17 edited books and proceedings, 11 book chapters, and over 150 journal/conference publications. In recent years, his interest is mainly on non-iid based learning and mining, targeting coupled object analysis and pattern relation analysis.
Coming from a seniorindustry leadership commitment, his following expertise and experience has been widely recognized in the academia, government, business and industry: enterprise applications of intelligent big data analytics and intelligent systems and decisions in areas such as customer analysis, fraud detection, outlier detection, risk management, and operational analysis in domains including social security, taxation, capital markets, online business, banking, telecommunication, health care, insurance, marketing, and so on in the real world.

Accepted Papers

Yun Sing Koh and Gill Dobbie
Indirect Weighted Association Rules Mining for Academic Network Collaboration Recommendations
Yu-Shiang Hung, Kuei-Ling B. Chen, Chi-Ta Yang and Guang-Feng Deng
Mining cluster-based patterns for elder self-care behavior
Dr Andrei Kelarev, Andrew Stranieri, John Yearwood, Herbert Jelinek and Jemal Abawajy
Improving Classifications for Cardiac Autonomic Neuropathy Using Multi-level Ensemble Classifiers and Feature Selection Based on Random Forest
Muhammad Marwan Muhammad Fuad
ABC-SG: A New Artificial Bee Colony Algorithm-Based Distance of Sequential Data Using Sigma Grams
Jonathan Wells, Kai Ming Ting and Chandrasiri Naiwala
A non-time series approach to vehicle related time series problems
Mahmood Khan and Md. Zahidul Islam
Evaluating the Performance of Several Data Mining Methods for Predicting Irrigation Water Requirement
Lavneet Singh and Girija Chetty
A Novel Machine Learning Approach for Detecting the Brain Abnormalities from MRI Structural Images
Endang Djuana, Yue Xu and Yuefeng Li
Learning Personalized Tag Ontology from User Tagging Information
Dang Bach Bui, Fedja Hadzic and Michael Hecker
Application of Tree-structured Data Mining for Analysis of Process Logs in XML format
Md Anisur Rahman and Md Zahidul Islam
CRUDAW: A Novel Fuzzy Technique for Clustering Records Following User Defined Attribute Weights
Kamal Taha
GOtoGene: A Method for Determining the Functional Similarity among Gene Products
Muhammad Usman, Russel Pears and Alvis Fong
Data Guided Approach to Generate Multi-dimensional Schema for Targeted Knowledge Discovery
Chao Sun, David Stirling, Christian Ritz and Claude Sammut
Variance-wise Segmentation for a Temporal-Adaptive SAX
Dinusha Vatsalan and Peter Christen
An Iterative Two-Party Protocol for Scalable Privacy-Preserving Record Linkage
T. L. Oshini Goonetilleke and H. A. Caldera
Mining Life Insurance Data for Customer Attrition Analysis
Michael Mayo
Cartesian Genetic Programming for Trading: A Preliminary Investigation
Jing Zhang, Derek Yu-Hsn Liu, Kok-Leong Ong, Zhijie Li and Ming Li
Detecting Topic Labels for Tweets by Matching Features from Pseudo-Relevance Feedback
Hamid Ghous, Paul Kennedy, Daniel Catchpoole and Nicholas Ho
Functional Visualisation of Genes using Singular Value Decomposition
Md. Sazzad Hussain, Hamed Monkaresi and Rafael A. Calvo
Combining Classifiers in Multimodal Affect Detection
Peter Sunehag, Wen Shao and Marcus Hutter
Coding of Non-Stationary Sources as a Foundation for Detecting Change Points and Outliers in Binary Time-Series
Qinxue Meng and Paul Kennedy
Using network evolution theory and singular value decomposition method to improve accuracy of link prediction in social networks
Kaimin Yu, Zhe Li, Genliang Guan, Zhiyong Wang and David Feng
Unsupervised Text Segmentation using LDA and MCMC
Helen Giggins and Ljiljana Brankovic
VICUS - A Noise Addition Technique for Categorical Data
Satya Gautam Vadlamudi, Partha Pratim Chakrabarti and Sudeshna Sarkar
Anytime Algorithms for Mining Groups with Maximum Coverage
Jialing Li, Li Li and Xiao Wen
A Collaborative Filtering Recommendation System Combining Semantics and Bayesian Reasoning
Omar S. Soliman and Amr Adly
Associative Classification using a Bio-Inspired Algorithm

For further information contact AusDM@togaware.com