登入
選單
返回
Google圖書搜尋
Mining Imperfect Data
Ronald K. Pearson
其他書名
Dealing with Contamination and Incomplete Records
出版
SIAM
, 2005-01-01
主題
Computers / Database Administration & Management
Mathematics / Probability & Statistics / General
Mathematics / Applied
Computers / Data Science / Data Analytics
ISBN
0898717884
9780898717884
URL
http://books.google.com.hk/books?id=r0LmgeMMWmEC&hl=&source=gbs_api
EBook
SAMPLE
註釋
Data mining is concerned with the analysis of databases large enough that various anomalies, including outliers, incomplete data records, and more subtle phenomena such as misalignment errors, are virtually certain to be present. Mining Imperfect Data describes in detail a number of these problems, as well as their sources, their consequences, their detection, and their treatment. Specific strategies for data pretreatment and analytical validation that are broadly applicable are described, making them useful in conjunction with most data mining analysis methods. Examples are presented to illustrate the performance of the pretreatment and validation methods in a variety of situations, both simulation based, where "correct" results are known unambiguously, and real data examples that illustrate typical cases met in practice.