Dòng Nội dung
1
Big data : principles and best practices of scalable real-time data systems / Nathan Marz, James Warren.
Shelter Island, NY. : Manning, 2015
308 p. ; 23 cm.

Web-scale applications like social networks, real-time analytics, or e-commerce sites deal with a lot of data, whose volume and velocity exceed the limits of traditional database systems. These applications require architectures built around clusters of machines to store and process data of any size, or speed. Fortunately, scale and simplicity are not mutually exclusive. Big Data teaches you to build big data

2
Data mining : concepts and techniques / Jiawei Han, Micheline Kamber, Jian Pei
Burlington, MA : Elsevier, 2012
xxxv, 703 p. : ill. ; 25 cm.




3
Data mining : concepts and techniques / Jiawei Han, Micheline Kamber, Jian Pei.
Amsterdam ; Boston : Elsevier/Morgan Kaufmann, 2011
1 online resource (xxxv, 703 pages) : ill. ; 29 cm.

Not only does the third of edition of Data Mining: Concepts and Techniques continue the tradition of equipping you with an understanding and application of the theory and practice of discovering patterns hidden in large data sets, it also focuses on new, important topics in the field: data warehouses and data cube technology, mining stream, mining social networks, and mining spatial, multimedia and other complex data. Each chapter is a stand-alone guide to a critical topic, presenting proven algorithms and sound implementations ready to be used directly or with strategic modification against live data. This is the resource you need if you want to apply today's most powerful data mining techniques to meet real business challenges.

4
Data mining : the textbook / Charu C. Aggarwal
New York : Springer, 2015
xxix, 734 p. ; 180 ill. ; 24cm. :

This textbook explores the different aspects of data mining from the fundamentals to the complex data types and their applications, capturing the wide diversity of problem domains for data mining issues. It goes beyond the traditional focus on data mining problems to introduce advanced data types such as text, time series, discrete sequences, spatial data, graph data, and social networks. Until now, no single book has addressed all these topics in a comprehensive and integrated way. The chapters of this book fall into the following categories: Fundamental chapters: Data mining has four main problems, which correspond to clustering, classification, association pattern mining, and outlier analysis. These chapters comprehensively discuss a wide variety of methods for these problems; Domain chapters: These chapters discuss the specific methods used for different domains of data such as text data, time-series data, sequence data, graph data, and spatial data; Application chapters: These chapters study important applications such as stream mining, Web mining, ranking, recommendations, social networks, and privacy preservation. The domain chapters also have an applied flavo


5
The elements of statistical learning : Data mining, inference, and prediction / Trevor Hastie, Robert Tibshirani, J H Friedman.
New York : Springer, 2009
XXII, 745 p. : ill. (some color), charts ; 24 cm.

During the past decade there has been an explosion in computation and information technology. With it have come vast amounts of data in a variety of fields such as medicine, biology, finance, and marketing. The challenge of understanding these data has led to the development of new tools in the field of statistics, and spawned new areas such as data mining, machine learning, and bioinformatics. Many of these tools have common underpinnings but are often expressed with different terminology. This book describes the important ideas in these areas in a common conceptual framework. While the approach is statistical, the emphasis is on concepts rather than mathematics. Many examples are given, with a liberal use of color graphics. It is a valuable resource for statisticians and anyone interested in data mining in science or industry. The book's coverage is broad, from supervised learning (prediction) to unsupervised learning. The many topics include neural networks, support vector machines, classification trees and boosting---the first comprehensive treatment of this topic in any book.