Data Science at the Command Line: Facing the Future with Time-Tested Tools demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data. This work is licensed under the Creative Commons Attribution-NoDerivatives 4.0 International License.
Discover why the command line is an agile, scalable, and extensible technology. Even if you’re already comfortable processing data with, say, Python or R, you’ll greatly improve your data science workflow by also leveraging the power of the command line.
Topics included: Introduction • Getting Started • Obtaining Data • Creating Reusable Command-line Tools • Scrubbing Data • Managing Your Data Workflow • Exploring Data • Parallel Pipelines • Modeling Data.
Download Free PDF / Read Online
Publisher: O’Reilly Media
Published: October 2014
Format(s): Online (HTML)
File size: –
Number of pages: 212
Download / View Link(s): Read online