Learn to Analyze Text Data in Bash Shell and Linux - PDF by Learn Scientific Programming

Learn to Analyze Text Data in Bash Shell and Linux - PDF

A step by step guide to learn bash shell using data projects by Ahmed Arefin. PhD

Get access for $7 $3.50
Thumb bash book 6 1200x874


This book will help you to explore real-world large data sets with the help of Bash commands. As seen on the: 

There are several examples of practical data mining that will have a flow of importing specific data resources into flat text-type files. Bash can run different programs (grep, sort, sed, and so on) on those files, clean, optimize and extract preliminary views (cut, csvlook, view, cat, head, etc.) of the data.  A scripting language like Bash can be very useful for doing the transformation.

This course starts with some practical bash-based flat file data mining projects involving: 
  • University ranking data
  • Facebook data
  • Crime Data
  • Shakespeare-era plays and poems data 
(all data collected from the public domain and included)

Bonus contents:
  • Data sets
  • Source codes

Don't forget to get the video animated version of this book too!

What's included?

File Icon 3 files


3.66 MB
Supporting Files
3.39 MB
3.37 MB


Who is this book for?

This book is suitable for anyone who wants to deal with some kinds of data! such as students or researchers who want to add Bash and other command line tools to their bag of tricks, scientists who want to learn to explore and analyze the data that their lab generates, or even journalists who want to polish their reporting by analyzing publicly-available data sets.

What if I haven't used Bash Shell before?

If you haven’t used Bash before, feel free to skip the projects and get to the tutorials part. Read the tutorials and then come back to the projects again. The tutorial section will introduce with bash scripting, regular expressions, AWK, sed, grep and so on.

Why should I use Bash shell scripting for data mining?

Bash may not be the best way to handle all kinds of data, but there often comes a time when you are provided with a pure Bash environment, such as what we get in the common Linux-based Supercomputers and you just want an early result or view of the data before driving into the real programming, using Python, R and SQL, SPSS, and so on. Expertise in data-intensive languages comes at the price of spending a lot of time on them. In contrast, bash scripting is simple, easy to learn and perfect for mining textual data.

How this book is different from any other Bash/Linux manuals?

You wouldn't not find many beginner's books that discuss Bash in the context of data sciences! I wanted to create a super beginner friendly reading material that would help the people who are not very much familiar with Bash/Linux, but willing to use the power of it. I hope you will enjoy it!

What is your return policy?

If you are completely not satisfied, you can get a refund within 7 days of purchase.