Home
CFP
Submission
Dates
Program
Presentations
Register
Organisers

AusDM
Togaware


Invited and Accepted Papers

Keynotes

  • DATA = NORMAL + ANOMALOUS + NOISE
    Professor Sanjay Chawla

    Abstract: Our world at the micro, macro and personal level is now highly instrumented. A consequence of this instrumentation is that now it is possible to obtain fine-grained data about almost anything of interest. Once we focus on an application or a domain, it is reasonable to assume that much of the data obtained captures the "normal" behavior of the underlying phenomenon. Historically, "knowledge discovery," if any, has been triggered by the non-normal or anomalous part of the data. In this talk I will present some classic examples of data anomalies and how their discovery has changed our understanding of the world. Then I will present a modern and algorithmic viewpoint of anomaly detection as is currently practiced in the data mining community.

    Sanjay Chawla is the Professor of Pattern and Data Mining at the University of Sydney. He served as the Head of the School of Information Technologies from 2008-2011. His research work has appeared in leading international conferences and journals. Along with his students, he has received four best paper awards in the last six years. In 2012, he was the program co-chair of the 16th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD). He serves on the editorial board of the journal DMKD and is a member of the ACM.

  • Non-iidness: Coupled Object and Pattern Analysis
    Professor Longbing Cao

    Most of existing data mining algorithms are based on the IID assumption, which treats objects independently from each other. In the real world, objects are either loosely or tightly coupled with each other. For instance, a moving vehicle on the street interacts with the cars before and after it, and the ones on its left and right hand sides if any. In social networks, people interact with each other at different levels for varied purposes. Such interactions, or coupling relationships, are ubiquitous, and spread at various levels, between objects, between attributes describing an object, between attribute values within an attribute. It is crucial to cater for such relations in object analysis.

    On the other hand, the usual patterns identified by data mining are based on independent objects or items. For instance, often a large number of frequent patterns are mined by the existing algorithms, which are often treated as independent with each other. In fact, due to the object coupling relationships, patterns are associated with each other in structural and/or semantic aspects. Pattern relationship analysis is often ignored.

    In this talk, we will explore the needs, challenges, opportunities of analyzing complex object relations and complex pattern relations. On top of a framework for noniid-based coupled object and pattern analysis, several corresponding techniques will be introduced: coupled object analysis to define and quantify the coupling relationships within and between objects and within and between attributes, combined pattern mining to identify a group of patterns coupled by certain relationships. Coupled behavior analysis will be explored to analyse a group of actors’ behaviors. We will show how such new frameworks outperform the classic iid-based data mining framework in terms of handling complex data, behavior, relation, environment and pattern in clustering, frequent pattern mining, and classification. Several real-life applications will be given, such as the identification of group-based market manipulations in stock markets.

    Dr Longbing Cao is a professor of information technology at the University of Technology Sydney (UTS) Australia. He got PhD in Intelligent Sciences and PhD in Computing Science. He is the Founding Director of Advanced Analytics Institute, one of three university research institutes at UTS. AAI is the largest analytics research group in Australia, with the widest and deepest engagement with major local and overseas government and industry organizations in many sectors and domains. Prof Cao is also the Research Leader of the Data Mining Program at the Australian Capital Markets Cooperative Research Centre. He is a Senior Member of IEEE, SMC Society and Computer Society.

    Prof Cao's primary research interests include data mining and machine learning, and artificial intelligence and intelligent systems. He initiated and is leading the research on behavior informatics, domain driven data mining, agent mining, and open complex intelligent systems. He has published 2 monographs, 17 edited books and proceedings, 11 book chapters, and over 150 journal/conference publications. In recent years, his interest is mainly on non-iid based learning and mining, targeting coupled object analysis and pattern relation analysis.

    Coming from a seniorindustry leadership commitment, his following expertise and experience has been widely recognized in the academia, government, business and industry: enterprise applications of intelligent big data analytics and intelligent systems and decisions in areas such as customer analysis, fraud detection, outlier detection, risk management, and operational analysis in domains including social security, taxation, capital markets, online business, banking, telecommunication, health care, insurance, marketing, and so on in the real world.

Accepted Papers

  • Yun Sing Koh and Gill Dobbie
    Indirect Weighted Association Rules Mining for Academic Network Collaboration Recommendations
  • Yu-Shiang Hung, Kuei-Ling B. Chen, Chi-Ta Yang and Guang-Feng Deng
    Mining cluster-based patterns for elder self-care behavior
  • Dr Andrei Kelarev, Andrew Stranieri, John Yearwood, Herbert Jelinek and Jemal Abawajy
    Improving Classifications for Cardiac Autonomic Neuropathy Using Multi-level Ensemble Classifiers and Feature Selection Based on Random Forest
  • Muhammad Marwan Muhammad Fuad
    ABC-SG: A New Artificial Bee Colony Algorithm-Based Distance of Sequential Data Using Sigma Grams
  • Jonathan Wells, Kai Ming Ting and Chandrasiri Naiwala
    A non-time series approach to vehicle related time series problems
  • Mahmood Khan and Md. Zahidul Islam
    Evaluating the Performance of Several Data Mining Methods for Predicting Irrigation Water Requirement
  • Lavneet Singh and Girija Chetty
    A Novel Machine Learning Approach for Detecting the Brain Abnormalities from MRI Structural Images
  • Endang Djuana, Yue Xu and Yuefeng Li
    Learning Personalized Tag Ontology from User Tagging Information
  • Dang Bach Bui, Fedja Hadzic and Michael Hecker
    Application of Tree-structured Data Mining for Analysis of Process Logs in XML format
  • Md Anisur Rahman and Md Zahidul Islam
    CRUDAW: A Novel Fuzzy Technique for Clustering Records Following User Defined Attribute Weights
  • Kamal Taha
    GOtoGene: A Method for Determining the Functional Similarity among Gene Products
  • Muhammad Usman, Russel Pears and Alvis Fong
    Data Guided Approach to Generate Multi-dimensional Schema for Targeted Knowledge Discovery
  • Chao Sun, David Stirling, Christian Ritz and Claude Sammut
    Variance-wise Segmentation for a Temporal-Adaptive SAX
  • Dinusha Vatsalan and Peter Christen
    An Iterative Two-Party Protocol for Scalable Privacy-Preserving Record Linkage
  • T. L. Oshini Goonetilleke and H. A. Caldera
    Mining Life Insurance Data for Customer Attrition Analysis
  • Michael Mayo
    Cartesian Genetic Programming for Trading: A Preliminary Investigation
  • Jing Zhang, Derek Yu-Hsn Liu, Kok-Leong Ong, Zhijie Li and Ming Li
    Detecting Topic Labels for Tweets by Matching Features from Pseudo-Relevance Feedback
  • Hamid Ghous, Paul Kennedy, Daniel Catchpoole and Nicholas Ho
    Functional Visualisation of Genes using Singular Value Decomposition
  • Md. Sazzad Hussain, Hamed Monkaresi and Rafael A. Calvo
    Combining Classifiers in Multimodal Affect Detection
  • Peter Sunehag, Wen Shao and Marcus Hutter
    Coding of Non-Stationary Sources as a Foundation for Detecting Change Points and Outliers in Binary Time-Series
  • Qinxue Meng and Paul Kennedy
    Using network evolution theory and singular value decomposition method to improve accuracy of link prediction in social networks
  • Kaimin Yu, Zhe Li, Genliang Guan, Zhiyong Wang and David Feng
    Unsupervised Text Segmentation using LDA and MCMC
  • Helen Giggins and Ljiljana Brankovic
    VICUS - A Noise Addition Technique for Categorical Data
  • Satya Gautam Vadlamudi, Partha Pratim Chakrabarti and Sudeshna Sarkar
    Anytime Algorithms for Mining Groups with Maximum Coverage
  • Jialing Li, Li Li and Xiao Wen
    A Collaborative Filtering Recommendation System Combining Semantics and Bayesian Reasoning
  • Omar S. Soliman and Amr Adly
    Associative Classification using a Bio-Inspired Algorithm

For further information contact AusDM@togaware.com