Facebook Twitter Pinterest
  • Statt: EUR 32,00
  • Sie sparen: EUR 1,53 (5%)
  • Alle Preisangaben inkl. MwSt.
Nur noch 3 auf Lager (mehr ist unterwegs).
Verkauf und Versand durch Amazon. Geschenkverpackung verfügbar.
Agile Data Science: Build... ist in Ihrem Einkaufwagen hinzugefügt worden
+ EUR 3,00 Versandkosten
Gebraucht: Sehr gut | Details
Verkauft von Buchantiquariat
Zustand: Gebraucht: Sehr gut
Kommentar: Buch ist unbenutzt und ungelesen, als Mängelexemplar gekennzeichnet!
Möchten Sie verkaufen?
Zur Rückseite klappen Zur Vorderseite klappen
Hörprobe Wird gespielt... Angehalten   Sie hören eine Hörprobe des Audible Hörbuch-Downloads.
Mehr erfahren
Alle 2 Bilder anzeigen

Agile Data Science: Building Data Analytics Applications with Hadoop (Englisch) Taschenbuch – 22. Oktober 2013

Alle Formate und Ausgaben anzeigen Andere Formate und Ausgaben ausblenden
Neu ab Gebraucht ab
Kindle Edition
"Bitte wiederholen"
"Bitte wiederholen"
EUR 30,47
EUR 20,40 EUR 12,90
19 neu ab EUR 20,40 11 gebraucht ab EUR 12,90
click to open popover

Wird oft zusammen gekauft

  • Agile Data Science: Building Data Analytics Applications with Hadoop
  • +
  • Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Google+, GitHub, and More
Gesamtpreis: EUR 65,96
Die ausgewählten Artikel zusammen kaufen

Es wird kein Kindle Gerät benötigt. Laden Sie eine der kostenlosen Kindle Apps herunter und beginnen Sie, Kindle-Bücher auf Ihrem Smartphone, Tablet und Computer zu lesen.

  • Apple
  • Android
  • Windows Phone

Geben Sie Ihre Mobiltelefonnummer ein, um die kostenfreie App zu beziehen.

Jeder kann Kindle Bücher lesen — selbst ohne ein Kindle-Gerät — mit der KOSTENFREIEN Kindle App für Smartphones, Tablets und Computer.



Über den Autor und weitere Mitwirkende

Russell Jurney cut his data teeth in casino gaming, building web apps to analyze the performance of slot machines in the US and Mexico. After dabbling in entrepreneurship, interactive media and journalism, he moved to silicon valley to build analytics applications at scale at Ning and LinkedIn. He lives on the ocean in Pacifica, California with his wife Kate and two fuzzy dogs.


Es gibt noch keine Kundenrezensionen auf Amazon.de
5 Sterne
4 Sterne
3 Sterne
2 Sterne
1 Stern

Die hilfreichsten Kundenrezensionen auf Amazon.com (beta)

Amazon.com: HASH(0x97ab593c) von 5 Sternen 8 Rezensionen
16 von 18 Kunden fanden die folgende Rezension hilfreich
HASH(0x95f7a540) von 5 Sternen Absolute required reading for all new data scientists 3. Januar 2014
Von chad - Veröffentlicht auf Amazon.com
Format: Taschenbuch Verifizierter Kauf
I was once told by a chief data scientist that they would rather teach a mathematician programming than a programmer math (to be a data scientist). After being a data scientist for some time now I would have to respectfully disagree. 85% of data science is plumbing and I wouldn't hire a physicist to be a plumber. Indeed modern data scientists really do need to be full-stack developers trapped in an academic's body.
Jurney nails it! He offers tools and methodologies adapted to common data science workflows and their associated pitfalls wherein we spend 85% of our time plumbing and 15% of our time integrating some off-the-shelf algorithm to find deep insight.
So, for new data scientists or 3rd-4th year grad students who have balanced their Twitter API hack with NSF grant deadlines, this is ABSOLUTELY REQUIRED READING.
15 von 17 Kunden fanden die folgende Rezension hilfreich
HASH(0x95f7a78c) von 5 Sternen He sent this with NON-WORKING CODE 9. Juni 2014
Von SF - Veröffentlicht auf Amazon.com
Format: Kindle Edition Verifizierter Kauf
The story is nice, but the code that forms the basis of the entire project behind the book DOESN'T COMPILE. The author has - as of today (June 9, 2014) - removed all of the github references to the project.

I"m half way through the book, have been practicing Agile development techniques for several years, and I am not quite sure what in particular makes this book about Data Science 'Agile' based.

One thing that he does nicely is explain the Pig code he uses, but I can't use those programs because the Python programs that gather the data that feed Pig will not compile, even after I de-bugged his code for several hours. (Example: the author made reference to an RFC inline in the Python code that would have NEVER compiled. NEVER. Line 11 gmail.py from call to email utilitiies)
11 von 12 Kunden fanden die folgende Rezension hilfreich
HASH(0x95f7a750) von 5 Sternen Chapter 3 alone is worth the price of the entire book. 5. Februar 2014
Von Carsten Jørgensen - Veröffentlicht auf Amazon.com
Format: Taschenbuch
Book review - Agile Data Science by Russell Jurney, O'Reilly Media

The subtitle "Building Data Analytics Applications with Hadoop" of this book says more about the book than the actual title "Agile Data Science". However the subtitle will probably fool most people. Before reading this book I believed that Hadoop with the the distributed file-system HDFS. If you are looking for a book about building applications on the of HDFS then this book IS NOT for you. It turns out that Hadoop is much more than just HDFS.

Do not buy this book for learning about agile software development methodologies. There are some rather strange comments about personal and private space requirement for creative workers as well as mentioning of "Easy access to large-format printing is a requirement for the agile environment." The discussion about agile methods for working with data science is interesting. The basic question is if it is possible to bridge agile methods and data science since science in it's nature does not consists of a predefined set of tasks. It seems to me that the tools and software used in chapter 3 are called agile an hence is the process agile. In part II of the book the application build is chapter 3 is refined in a number of steps that the author calls iterative. But again, that does not make the process agile. I am not saying that the author is wrong but the point about the agile method and how process and tools interact to make the development agile is not entirely clear to me.

This is NOT a book about the inner workings of Hadoop. Please refer to "Hadoop: The Definitive Guide" by Tom White for O'Reilly Media for a thorough introduction to Hadoop. Instead the book takes a very practical approach and show us how to build agile applications using various Hadoop components like Pig, MapReduce, and the Avro serialization framework. In addition you will see how to move data into the popular noSQL database MongoDB and how to use ElasticSearch to search the data. Finally, all the collected data is accesses through a lightweight web application build with Python and Flask with visual enhancement made in Bootstrap and D3.

Agile Data Science covers a lot of material and uses lots of different software and tools. If you want to run the examples in the book you have two options 1) a user-contributed Linux Vagrant image is available with most of the required software or 2) you can follow along the instructions given in the book and the accompanied Github project and install the software yourself. In either case you have to pay close attention to software versions. All of the examples work but it does require some effort the get them running and if you feel uncomfortable using a terminal and command line you might have a hard time playing with the examples.

Being able to work in an agile way with data science is quite important but I do not feel that the attempt made by the author convinced me that the suggested framework will work in a practical setting.
The main value of this book is definitely chapter 3 where Jurney show us how to go from zero to a working data science application. The application is literally build from ground up starting with data collection over storing data to build a web front-end. This chapter is alone worth the price of the entire book.

Part II of the books contains interesting material about data visualizations and prediction models. For many readers some prior knowledge about Naive Bayes and the Natural Language Toolkit would most likely be useful to fully understand the implications of the predictions made around what makes an email likely to receive a response.

I review for the O`Reilly Reader Review Program and I want to be transparent about my reviews so you should know that I received a free copy of this ebook in exchange of my review.
5 von 5 Kunden fanden die folgende Rezension hilfreich
HASH(0x95f7ad2c) von 5 Sternen Could be useful for data scientists who need a familiarity with deployment environments. 17. Juni 2014
Von K. Luangkesorn - Veröffentlicht auf Amazon.com
Format: Taschenbuch
One of the problems with data science is that any description of what is encountered takes on the appearance of a mythical unicorn, noone person could possibly have all of the skills required. And it gets worse when you add to the standard set of statistics, domain knowledge, and programming the ability to deploy the application into a high speed environment. This book is not going to make a data scientist an expert in running a data center, but it is useful to give someone who has the rest of the skills an understanding of the environment their work will be deployed into.

One of the conflicts between the data scientist/analyst and information technology groups is that while the data scientist gives the data owned by the organization its value, IT is charged with storing the data and providing the access. And in a high velocity, high volume environment of big data, not understanding how the architecture works can lead to the data scientist creating valid solutions that cannot be applied in the actual day to day working environment. That is where this book comes in. The book has associated virtual machines in software repository so that the data scientist who does not know anything about infrastructure and the software stack that the data and the analysis rides on can see how everything fits together.

The book title is misleading. This is not a book about data analytics. This is a book for data analysts so they know how their analytical application is deployed and applied to day-to-day use in enterprise environments. For that reason it is useful.

Disclaimer: I received a free electronic copy of this book as part of the Oreilly Press Blogger program.
1 von 1 Kunden fanden die folgende Rezension hilfreich
HASH(0x95f7af48) von 5 Sternen Frustrating but beneficial. 25. Oktober 2014
Von Kev - Veröffentlicht auf Amazon.com
Format: Taschenbuch Verifizierter Kauf
This book is terribly frustrating. I have a lot of experience in statistical analysis, machine learning, and programming and picked up this book to learn how to apply these skills with "Agile Data Science." This mostly means implementing these skills with the tools of AWS, Hadoop, and hosting data products on scalable platforms.

The book does a great job of summarizing the agility needed and the tools used, but the code to implement these tools is lacking. I expected the code in the book to be outdated even though the book was only published a year ago. There is a github repository for the code but it is incomplete.

One benefit of the code not working or the instructions being vague is having to debug it yourself or search for solutions. This is a great learning tool. I don't think this is a benefit the author would like and should put some more time in the second iteration of making the instructions clearer. Since the book has only 164 pages there is significant room for growth.
Waren diese Rezensionen hilfreich? Wir wollen von Ihnen hören.