- Gebundene Ausgabe: 1096 Seiten
- Verlag: MIT Press (24. August 2012)
- Sprache: Englisch
- ISBN-10: 0262018020
- ISBN-13: 978-0262018029
- Größe und/oder Gewicht: 20,3 x 3,5 x 22,9 cm
- Durchschnittliche Kundenbewertung: 1 Kundenrezension
- Amazon Bestseller-Rang: Nr. 1.557 in Fremdsprachige Bücher (Siehe Top 100 in Fremdsprachige Bücher)
Machine Learning: A Probabilistic Perspective (Adaptive computation and machine learning.) (Englisch) Gebundene Ausgabe – 24. August 2012
|Neu ab||Gebraucht ab|
Wird oft zusammen gekauft
Kunden, die diesen Artikel gekauft haben, kauften auch
Es wird kein Kindle Gerät benötigt. Laden Sie eine der kostenlosen Kindle Apps herunter und beginnen Sie, Kindle-Bücher auf Ihrem Smartphone, Tablet und Computer zu lesen.
Geben Sie Ihre Mobiltelefonnummer ein, um die kostenfreie App zu beziehen.
"This comprehensive book should be of great interest to learners and practitioners in the field of machine learning." -- "British Computer Society"
Über den Autor und weitere Mitwirkende
Kevin P. Murphy is a Research Scientist at Google. Previously, he was Associate Professor of Computer Science and Statistics at the University of British Columbia.
Welche anderen Artikel kaufen Kunden, nachdem sie diesen Artikel angesehen haben?
The focus of the book is rather theoretical so you should probably be ready for that.
The writing however is very good and the exercises are helpful.
Die hilfreichsten Kundenrezensionen auf Amazon.com (beta)
Similar textbooks on statistical/probabilistic machine learning (links to book websites, not Amazon pages):
- Barber's Bayesian Reasoning and Machine Learning ("BRML", Cambridge University Press 2012)
- Koller and Friedman's Probabilistic Graphical Models ("PGM", MIT Press 2009)
- Bishop's Pattern Recognition and Machine Learning ("PRML", Springer 2006)
- MacKay's Information Theory, Inference and Learning Algorithms ("ITILA", CUP 2003)
- Hastie, Tibshirani and Friedman's Elements of Statistical Learning ("ESL", Springer 2009)
* Perspective: My perspective is that of a machine learning researcher and student, who has used these books for reference and study, but not as classroom textbooks.
* Audience/prerequisites: they are comparable among all the textbooks mentioned. BRML has lower expected commitment and specialization, PGM requires more scrupulous reading. The books differ in their topics and disciplinary approach, some more statistical (ESL), some more Bayesian (PRML, ITILA), some focused on graphical models (PGM, BRML). K Murphy compares MLAPP to others here. For detailed coverage comparison, read the table of contents on the book websites.
* Main strength: MLAPP stands out for covering more advanced and current research topics: there is a full chapter on Latent Dirichlet Allocation, learning to rank, L1 regularization, deep networks; in the basics, the decision theory part is quite thorough (e.g. will mention Jeffrey's/uninformative priors). The book is "open" and vivid, doesn't shy away from current research and advanced concepts. This seems to be purposeful, as it shows in many aspects:
- quotes liberally from web sources, something usually not done in academic publications
- borrows "the best" from other authors (always with permission and acknowledgment, of course): most importantly the best pictures and diagrams, but also tables, recaps, insightful diagrams. Whereas other books will produce their own pictures and diagrams themselves (eg, PRML has a distinctive clarity and style in its illustrations), MLAPP takes many of its colour illustrations from other people's publications; therefore it can select the most pithy and relevant pictures to make a point. You could think that reproductions may be illegible and require extra effort to interpret because they come from a variety of sources; I have found that the bonus coming from having precisely the right image prevails.
- frequent references to the literature, mentions of extensions and open questions, as well as computational complexity considerations: for instance, the section on HMMs will mention duration modeling and variable-duration Markov models, and a comparison of the expressive power of hierarchical HMMs versus stochastic context-free grammars, complete with relevant citations, and a brief mention of the computational complexity results from the publications. All this connects the material with research and new ideas in a fine way -- which other textbooks don't achieve, I find. For instance, PGM defers references to a literature section at the end of each chapter, resulting in a more self-contained, but more poorly "linked" text.
* Didactic aids: Another distinctive feature is that the author clearly has tried to include didactic aids gathered over the years, such as recaps, comparative tables, diagrams, much in the spirit of the "generative model of generative models" (Roweis and Ghahramani): e.g. table comparing all models discussed, pros and cons of generative vs. discriminative models, recap of operations on HMMs (smoothing, filtering etc), list of parameter estimation methods for CRFs.
* Editorial features: Other editorial features worth mentioning are
- compared to others, helpful mentions of terminology, e.g. jargon, nomenclature, concept names, in bold throughout the text ("you could also devise a variant thus; this is called so-and-so")
- mathematical notation relatively clear and consistent, occasional obscurities. PGM stands out as excruciatingly precise on this aspect.
- boxes/layout: no "skill boxes" or "case study boxes" (PGM), not many roadmap/difficulty indications like ITILA or PGM, examples are present but woven into the text (not separated like PGM or BRML). Layout rather plain and homogeneous, much like PRML.
- sadly lacks list of figures and tables, but has index of code
* Complete accompanying material:
- interesting exercises (yet fewer than PRML, BRML, PGM); solutions, however, are only accessible to instructors (same with BRML, PGM), which in my experience makes them only half as useful for the self-learner. PRML and ITILA have some solutions online resp. in the book.
- accompanying Matlab/Octave source code, which I found more readily usable than BRML's. PGM and PRML have no accompanying source code, even though the toolkit distributed with Koller's online PGM class might qualify as one. I find accompanying code a truly useful tool for learning; there's nothing like trying to implement an algorithm, checking your implementation against a reference, having boilerplate/utility code for the parts of the algorithm you're not interested in re-implementing. Also, code may clarify an algorithm, even when presented in pseudo-code. By the way, MLAPP has rather few pseudo-code boxes (like BRML or PRML, while PGM is very good here).
- MLAPP is not freely available as a PDF (unlike BRML, closest topic-wise, ESL, or ITILA). This will no doubt reduce its diffusion. My own take on the underlying controversy is in favor of distributing the PDF: makes successful books widely popular and cited (think ITILA or Rasmussen and Williams' Gaussian Processes), increases the book's overall value, equips readers with a weightless copy to annotate with e-ink, or consult on the go. I believe PDF versions positively impact sales, too: impact neutral-to-positive to course textbook/university library sales, indifferent to sales in countries with widely different purchase power, positive to all other segments due to enormous diffusion/popularity.
The closest contender to this book I believe is BRML. Both are excellent textbooks and have accompanying source code.
BRML is more accessible, has a free PDF version, and a stronger focus on graphical models.
MLAPP has all the qualities of an excellent graduate textbook (unified presentation, valuable learning aids), and yet is unafraid of discussing detail points (e.g. omnipresent results on complexity), as well as advanced and research topics (LDA, L1 regularization).
1. Take a look at the online Errata. This book is already in it's 3rd printing and it just came out. The list of corrections for this (the 3rd edition) is already mind-numbingly long. The 4th printing coming out this month will surely fix some errors, but there are just too many.
2. Our class has an online forum (for a 100 person class) where we discuss topics, and most questions are either (a) basic topics from the book that no one understood or (b) talking about how one figure in the book has multiple errors associated with it. At first I was really excited to find mistakes and submit them to the Errata - it was like I was part of the book! Now I just get frustrated and have already given up on submitting corrections.
3. Our instructor regrets using this book and modifies the examples before giving them to us in class. Our out of class readings now consist mostly of MetaAcademy.com.
4. There are hardly any worked-through examples, and many of those that are worked through have errors.
5. Many important concepts are skimmed over way too quickly. For example, there is a whole chapter on Logistic regression. However, Logistic regression is covered for exactly 2 pages. Then a weird 3D graph is presented but not explained (a common theme throughout the book is graphs that look absolutely amazing, but which convey little information as to exactly what's going on to a lay-person like me), then the rest of the chapter presents methods for doing the math, which I'm sure are useful in some sense, but I'm still thinking: "why is this MLE not in closed form, what is a Hessian doing here... and wtf is going on?!"
Most students just got the PDF for free online, and I would highly suggest doing something other than paying $55 for this book.
A. For somewhat theoretical approach to machine learning
1. If you have less than a month to study it: Read Learning From Data (~$30).
2. If you have a semester: Read Learning From Data along with lecture series by Yaser's on youtube.
B. For more applied approach to machine learning
1. If you have semester: Go through Andrew Ng's lecture series
For intermediate to advanced:
1. If you have a semester: Read this book.
Other classic machine learning textbooks, if you have more time:
1. Pattern Recognition And Machine Learning (The first book I read on Machine Learning. Very accessible. More detailed than Yasir's book, but less than Kevin's book)
2. The Nature of Statistical Learning Theory (Information Science and Statistics) by Vapnik (One of the pioneer in this field. Extremely theoretical approach)
3. Elements of Statistical Learning - Hastie et al (free pdf copy available)
One other thing, maybe again because it was a bit rushed, the book is not well organized. Most of the time you feel like the author took a bunch of sections written in different times and simply pasted them one after another. There is no coherent narrative that takes you through the text.
In short, although I appreciate the effort, I must say I'm disappointed with this book, especially given the hype about this book. I think this book needs some serious review, and first of all some proofreading.
However, the book suffers in its current form (my copy is the second printing) from the author's and publisher's hurry to get it in print. (The author admits, on the site devoted to the book, that such a rush existed.) The result is a book that, as other reviewers have noted, is less than perfectly organized and riddled with typographical and technical errors.
The errata notes on the author's site suggest that the third printing will be much improved over the first and second (including, among other things, an essentially rewritten chapter). My guess, though, is that it will take a good few years before the book's errors are altogether extirpated. (Aside: Why is it nearly impossible to determine the current printing of a book before you're actually holding it in your hands? This is pretty important if you're trying to get the cleanest copy possible.)
This book has all the ingredients of a five-star ML text. At the moment, though, it is hard to justify more than a three-star rating (especially given its hefty price). I suspect that my ultimate rating will be a function of the book's printing number, its exact shape depending largely on the diligence of the publisher, the author, and his readership.