Discovering Characterizations of the Behavior of Anomalous Sub-populations - 2012


We consider the problem of discovering attributes, or properties, accounting for the a-priori stated abnormality of a group of anomalous individuals (the outliers) with respect to an overall given population (the inliers). To this aim, we introduce the notion of exceptional property and define the concept of exceptionality score, which measures the significance of a property. In particular, in order to single out exceptional properties, we resort to a form of minimum distance estimation for evaluating the badness of fit of the values assumed by the outliers compared to the probability distribution associated with the values assumed by the inliers. Suitable exceptionality scores are introduced for both numeric and categorical attributes. These scores are, both from the analytical and the empirical point of view, designed to be effective for small samples, as it is the case for outliers. We present an algorithm, called EXPREX, for efficiently discovering exceptional properties. The algorithm is able to reduce the needed computational effort by exploring only relevant numerical intervals and by exploiting suitable pruning rules. The experimental results confirm that our technique is able to provide knowledge characterizing outliers in a natural manner.

Did you like this research project?

To get this research project Guidelines, Training and Code... Click Here

PROJECT TITLE :Discovering Program Topoi via Hierarchical Agglomerative Clustering - 2018ABSTRACT:In long lifespan software systems, specification documents will be outdated or even missing. Developing new software releases or
PROJECT TITLE : Event Detection and User Interest Discovering in Social Media Data Streams - 2017 ABSTRACT: Social media plays an increasingly necessary role in people’s life. Microblogging may be a form of social media
PROJECT TITLE : Discovering Newsworthy Themes From Sequenced Data: A Step Towards Computational Journalism - 2017 ABSTRACT: Automatic discovery of newsworthy themes from sequenced knowledge will relieve journalists from manually
PROJECT TITLE :Discovering Latent Semantics in Web Documents Using Fuzzy ClusteringABSTRACT:Web documents are heterogeneous and advanced. There exists sophisticated associations within one internet document and linking to the
PROJECT TITLE :Interactive Visual Discovering of Movement Patterns from Sparsely Sampled Geo-tagged Social Media DataABSTRACT:Social media information with geotags can be used to track individuals's movements in their daily lives.

Ready to Complete Your Academic MTech Project Work In Affordable Price ?

Project Enquiry