About This Course

Pandas (the Python Data Analysis library) provides a powerful and comprehensive toolset for working with data, including tools for reading and writing diverse files, data cleaning and wrangling, analysis and modeling, and visualization. Fields with widespread use of Pandas include: data science, finance, neuroscience, economics, advertising, web analytics, statistics, social science, and many areas of engineering.

The Pandas Mastery Workshop is an intensive 3-day class designed for students to gain proficiency using the Python Pandas library for data analysis. With a hands-on, exercise-intensive design and individualized instructor coaching, students will leave this class with capability to immediately transfer their learnings into application including:

  • Accessing Data From Multiple Sources
  • Cleaning and Preparing Data
  • Database Access and Data Wrangling
  • Data Visualization
  • Data Analysis
  • Real-World Modeling and Problem Solving

 

Course Overview

The class progresses step-by-step through a repeatable data analysis workflow using the Python Pandas library, including: reading in data from multiple sources and databases, cleaning, merging, and munging data to prepare it for analysis, and data exploration and visualization.

This class is perfect for someone who uses or plans to use Python and Pandas regularly in their day-to-day work, and wants to achieve a high level of proficiency rapidly. Quantitative analysts, data scientists, and business analysts will find this class particularly beneficial.

Students in the Pandas Mastery Workshop develop transferable skills in using Pandas for data loading, wrangling, cleaning, visualization, manipulation, transformation, and analysis.

Contact Us

Questions or need help registering? Call us 512.536.1057 or fill out the form:

Course Instructors

Enthought instructors have doctorates in scientific fields such as physics, engineering, computer science, and mathematics, and all have extensive experience through research and consulting in applying Python to solve complex problems across a range of industries, allowing them to bring their real world experience to the classroom every day. Enthought instructors possess professional, first-hand experience with the tools and technologies covered in our courses.

Testimonials

Course Syllabus & Topics

Course Prerequisites

Knowledge of programming in the Python standard language (data structures, control flow, assignment, functions, and package access) and familiarity with array programming in NumPy is required. Knowledge of general data analysis techniques and basic statistics (mean, standard deviation, correlation, etc.) is strongly recommended.

Individuals who have taken Enthought’s Python Foundations, Python for Scientists and Engineers, or Python for Data Science classes will have the prerequisite knowledge for the course.

Accessing Data From Multiple Sources

  • Reading and writing data from local files (.txt,.csv,.xls, .json, etc)
  • Reading data from remote files
  • Scraping tables from web pages (.html)
  • Making the most of the powerful read_table method

Cleaning and Preparing Data

  • Working with Pandas data structures: Series and DataFrame
  • Accessing your data: indexing, slicing, fancy indexing, boolean indexing
  • Data wrangling, including dealing with dates and times and missing data
  • Adding, dropping, selecting, creating, and combining rows and columns

Database Access and Data Wrangling

  • Database access with DB-API2 and SQLAlchemy
  • Executing SQL commands from Pandas
  • Loading database data into a DataFrame
  • Combining and manipulating DataFrames: merge, join, concatenate

Data Visualization

  • Understanding the structure of a Figure
  • Data visualization: scatter plots, line plots, box plots, bar charts,and histograms with matplotlib
  • Customizing plots: important attributes and arguments

Data Analysis

  • Split-apply-combine with DataFrames
  • Data summarization and aggregation methods
  • Pandas powerful groupby method
  • Reshaping, pivoting, and transforming your data
  • Simple and rolling statistics

Real-World Modeling and Problem Solving

  • Deep learning of the data analysis tools through lectures, Q&A, and hands-on exercises
  • Develop transferable skills through application to authentic data sets
  • Predict the future with time series analysis
  • And more!

Open Class Schedule

The course is 3 days, 9:00 AM to 5:00 PM. Onsite corporate classes are also available. Discounts are available for 3 or more attendees and academics currently at a degree-granting institution. Contact us to learn more.

WhereWhenPrice (per person)Reserve a Seat
Contact us with the form to the right to request an onsite class or a class in your area!

Contact Us

Questions or ready to register? Call us 512.536.1057 or fill out the form below:

FAQs

What is the difficulty level of the Pandas Mastery Workshop?

Overall the Pandas Mastery Workshop would be considered an intermediate-level course, and requires previous Python experience (specifically a working knowledge of the Python standard language and familiarity with array programming in NumPy). Some exposure to Pandas specifically is useful, but not required. The course is fast-paced and covers a lot of ground, so if you consider yourself an experienced beginner in Pandas then it should be a great fit.

Is a class completion certificate provided?

Yes, a class completion certificate is provided for the Pandas Mastery Workshop.

Have a question that isn’t answered here? Contct us or call 512.536.1057.

Testimonials

After finishing Enthought's 'Python for Data Science' and 'Pandas Mastery Workshop' course series, I feel confident and prepared to tackle even the ugliest datasets around. Their teachers are very knowledgeable and do a great job explaining tricky topics with ease and clarity. I highly (...)

Business Analyst, Information Technology, Xceed Group Consulting

Terrific course! Perfect foundation to train my entire team of data scientists. We now have a common language and common set of tools for our daily research. I'm looking forward to seeing the full impact of this workshop over the next several months.

Research Leader, Drug Discovery, Schrodinger

I felt the example working through real data such as the climate examples where extremely useful. Also the ability to ask questions and have side discussions on more advanced topics was very helpful.

Great class to scale up in Pandas, great instructor and excellent course-materials. Definitely recommendable.

Overall, I thought it was a great class! I learned a lot of great tricks, and also solidified my understanding of the essentials. Thanks!

The finger exercises and the projects were both very helpful in solidifying the concepts.

The depth and breadth of this course will provide the foundation to efficient data manipulation and encapsulation.

At first, I thought the fundamental work was not needed as I have had some decent experience with pandas but I am glad as some of the subtleties help me understand more about how pandas works.