IMI-BAS BAS
 

BulDML at Institute of Mathematics and Informatics >
IMI >
IMI Periodicals >
Serdica Journal of Computing >
2014 >
Volume 8 Number 2 >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10525/2433

Title: Seeking Relationships in Big Data: A Bayesian Perspective
Authors: Singpurwalla, Nozer
Keywords: Association
Correlation
Dependence
Mutual Information
Prediction
Regression
Retrospective Data
Issue Date: 2014
Publisher: Institute of Mathematics and Informatics Bulgarian Academy of Sciences
Citation: Serdica Journal of Computing, Vol. 8, No 2, (2014), 97p-110p
Abstract: The real purpose of collecting big data is to identify causality in the hope that this will facilitate credible predictivity . But the search for causality can trap one into infinite regress, and thus one takes refuge in seeking associations between variables in data sets. Regrettably, the mere knowledge of associations does not enable predictivity. Associations need to be embedded within the framework of probability calculus to make coherent predictions. This is so because associations are a feature of probability models, and hence they do not exist outside the framework of a model. Measures of association, like correlation, regression, and mutual information merely refute a preconceived model. Estimated measures of associations do not lead to a probability model; a model is the product of pure thought. This paper discusses these and other fundamentals that are germane to seeking associations in particular, and machine learning in general. ACM Computing Classification System (1998): H.1.2, H.2.4., G.3.
URI: http://hdl.handle.net/10525/2433
ISSN: 1312-6555
Appears in Collections:Volume 8 Number 2

Files in This Item:

File Description SizeFormat
sjc-vol8-num2-2014-p97-p110.pdf213.23 kBAdobe PDFView/Open

 



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0!   Creative Commons License