Learn Practical Data Sciences with Bash Shell
This project-based interactive course will help you to become an expert in bash and learn to explore real-world large data sets. Revised and updated with 2021 changes.
- 142 Enrolled
- Beginner
- Created by Scientific Programming Team
- Last updated Mon, 07-Jun-2021
-
- INTRODUCTION
-
CLASSROOM RESOURCES
- Join Live! No classes sheduled yet!
- PDF E-Book
-
HANDS-ON BASH TUTORIALS
- Which Bash Shell?
- Bash variables and functions
- Bash meta characters
- Bash quotation
- Read and store user input with Bash Preview
- Bash conditional statements
- Bash looping statements Preview
- Bash arithmetic operations
- Bash arrays
- Practice with interactive shell Preview
-
REGULAR EXPRESSIONS AND BASH SHELL
- Introduction to Regular Expressions (RegEx)
- Basic Regular Expressions
- Extended Regular Expressions
- RegEx character classes
- RegEx atomic groups
- RegEx in Bash shell scripting
-
AWK HANDS-ON
- Learn AWK
- AWK Built-in variables and functions
- Practice AWK Preview
-
STREAM EDITOR (SED)
- SED - Stream Editor Preview
- SED Substitution
- SED and regular expressions
- SED delete, print and grouping
-
GREP HANDS-ON
- What is GREP?
- GREP Hands-on examples
- GREP and regular expressions
- GREP and Find
-
PROJECT 1: UNIVERSITY RANKING
- Data Preview
- Find the colleges in the ranklist (grep, pipe and wc) Preview
- Finding the percent of colleges in the ranklist
- Listing the Institutes from a given state
- Count all institutes from each state
- Project decision: University tuition vs ranks Preview
- Project-1: Video demo Preview
-
PROJECT 2: FACEBOOK DATAMINING
- Facebook data
- Data preview
- Finding the number of status entry
- Find the most popular status entry Preview
- Project decsion: Find the most popular entry
- Project-2: Video demo
-
PROJECT 3: CRIME STAT
- Data Introduction
- Data preview
- Find the top most crime name
- Finding the top most crime per city
- Project decision: The best with least crimes- scripting
- Project-3: Video demo
-
PROJECT 4: TEXT-MINING
- Data Introduction
- Data preview
- Counting the Plays and Poems
- Stat each author works
- Project decision: Dig the frequent words by Shakespeare
-
BIG DATA CONCEPTS
- Big Data and file formats
- Hadoop Distributed File System
- Map Reduce
- YARN
- Flume
- Spark
-
CONCLUSIONS
- Conclusions
- References
-
Description
This is a completely revised and updated 2020 version of the 'Learn Data with Bash Shell' course.
Bash may not be the best way to handle all kinds of data, but there often comes a time when you are provided with a pure Bash environment, such as what we get in the common Linux based supercomputers and you just want an early result or view of the data before driving into the real programming, using Python, R and SQL, SPSS, and so on. Expertise in data-intensive languages comes at the price of spending a lot of time on them. In contrast, bash scripting is simple, easy to learn and perfect for mining textual data. Therefore, learning Bash shell should be the first step if you want to say, Hello to “Big Data”!Â
This interactive course will demonstrate four practical flat file data mining projects involving four data projects (each with a different objective function):Â
- University ranking data,Â
- Facebook data,Â
- Australian statistics crime data andÂ
- Shakespeare-era plays and poems data (Big data!)
If you haven’t used Bash before, feel free to skip the projects and get to the tutorials part . Complete the tutorials first and then come back to the projects again. The tutorial section will introduce with bash scripting, regular expressions, AWK, sed, grep and so on. The course finishes with a near-complete list of references to all the relevant command line and Big data tools.
Target audience and prerequisites:Â Almost everyone can benefit from learning to use Bash particularly in data mining: particularly students who want to learn Bash and the command line to improve their career prospects, researchers who want to add Bash and other command line tools to their bag of tricks, scientists who want to learn to explore and analyze the data that their lab generates.
Requirements
- It is a self-learning course with all Linux environments provided.
Outcome
- Use Bash to quickly sort, search, match, replace, clean and optimise various aspect of a data set
- Use bash in processing real-world data sets (included)
- Use Bash commands and scripting
- Use Regular Expressions (RegEX) in Bash
- Use AWK programming language commands to tweak and format data
- Use SED and GREP to quickly search in large-scale data sets
-
Instructor Profile
The Scientific Programming Team helps you to learn the use of scientific programming languages, such as CUDA, Julia, OpenMP, MPI, C++, Matlab, Octave, Bash, Python Sed and AWK including RegEx in processing scientific and real-world data. The teamed is formed by PhD educated instructors in the areas of Computational Sciences.
-
Student feedback
5 -
COURSE DISCORD
Learn Practical Data Sciences with Bash Shell
This project-based interactive course will help you to become an expert in bash and learn to explore real-world large data sets. Revised and updated with 2021 changes.
$5 $25
Course includes
Already purchased? Please Login with your purchasing email and resume from dashboard!
SIMILAR COURSES
PRACTICE CONSOLE

InteractiveShell.com
By Scientific Programming School
Reviews (1)
An excellent introductory course to Bash and Data Sciecnes @ command line. Must have for all beginners to advanced users!