MARC View

000			02377cam a22002418i 4500
005			20250421080607.0
008			250213s2020 nyua b 001 0 eng
020			_a9781108476348 (hardback)
037			_bSBF2024
040			_aDLC _beng _cDLC _dAE-ShU _cUKB
050	0	0	_aHF5415.125 _b.L46 2020
100	1		_aLeskovec, Jurij, _eauthor.
245	1	0	_aMining of massive datasets / _cJure Leskovec, Stanford University, Anand Rajaraman, Rocketship VC, Jeffrey David Ullman, Stanford University.
250			_aThird edition.
260			_aNew York, NY : _bCambridge University Press, _c2020.
300			_axi, 553 pages : _billustrations ; _c26 cm
500			_aIncludes index.
520			_a"The Web, social media, mobile activity, sensors, Internet commerce, and many other modern applications provide many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and can be used on even the largest datasets. It begins with a discussion of the MapReduce framework and related techniques for efficient parallel programming. The tricks of locality-sensitive hashing are explained. This body of knowledge, which deserves to be more widely known, is essential when seeking similar objects in a very large collection without having to compare each pair of objects. Stream-processing algorithms for mining data that arrives too fast for exhaustive processing are also explained. The PageRank idea and related tricks for organizing the Web are covered next. Other chapters cover the problems of finding frequent itemsets and clustering, each from the point of view that the data is too large to fit in main memory. Two applications: recommendation systems and Web advertising, each vital in e-commerce, are treated in detail. Later chapters cover algorithms for analyzing social-network graphs, compressing large-scale data, and machine learning. This third edition includes new and extended coverage on decision trees, deep learning, and mining social-network graphs. Written by leading authorities in database and Web technologies, it is essential reading for students and practitioners alike"-- _cProvided by publisher.
650		0	_aData mining.
700	1		_aRajaraman, Anand, _eauthor.
700	1		_aUllman, Jeffrey D., _d1942- _eauthor.
942			_cBKS
999			_c19335 _d19335