About This Course

Pandas (the Python Data Analysis library) provides a powerful and comprehensive toolset for working with data, including tools for reading and writing diverse files, data cleaning and wrangling, analysis and modeling, and visualization. Fields with widespread use of Pandas include: data science, finance, neuroscience, economics, advertising, web analytics, statistics, social science, and many areas of engineering.

The Pandas Mastery Workshop is an intensive 3-day class designed for students to gain proficiency using the Python Pandas library for data analysis. With a hands-on, exercise-intensive design and individualized instructor coaching, students will leave this class with capability to immediately transfer their learnings into application including:

  • Accessing Data From Multiple Sources
  • Cleaning and Preparing Data
  • Database Access and Data Wrangling
  • Data Visualization
  • Data Analysis
  • Real-World Modeling and Problem Solving
thumbnail

Course Overview

The class progresses step-by-step through a repeatable data analysis workflow using the Python Pandas library, including: reading in data from multiple sources and databases, cleaning, merging, and munging data to prepare it for analysis, and data exploration and visualization.

This class is perfect for someone who uses or plans to use Python and Pandas regularly in their day-to-day work, and wants to achieve a high level of proficiency rapidly. Quantitative analysts, data scientists, and business analysts will find this class particularly beneficial.
Students in the Pandas Mastery Workshop develop transferable skills in using Pandas for data loading, wrangling, cleaning, visualization, manipulation, transformation, and analysis.

Contact Us

Questions or need help registering? Call us at 512.536.1057 or fill out the form:

Course Syllabus & Topics

Course Prerequisites

Knowledge of programming in the Python standard language (data structures, control flow, assignment, functions, and package access) and familiarity with array programming in NumPy is required. Knowledge of general data analysis techniques and basic statistics (mean, standard deviation, correlation, etc.) is strongly recommended.

Individuals who have taken Enthought’s Python Foundations, Python for Scientists and Engineers, or Python for Data Science classes will have the prerequisite knowledge for the course.

Collapse All
  • Reading and writing data from local files (.txt,.csv,.xls, .json, etc)
  • Reading data from remote files
  • Scraping tables from web pages (.html)
  • Making the most of the powerful read_table method

  • Working with Pandas data structures: Series and DataFrame
  • Accessing your data: indexing, slicing, fancy indexing, boolean indexing
  • Data wrangling, including dealing with dates and times and missing data
  • Adding, dropping, selecting, creating, and combining rows and columns

  • Database access with DB-API2 and SQLAlchemy
  • Executing SQL commands from Pandas
  • Loading database data into a DataFrame
  • Combining and manipulating DataFrames: merge, join, concatenate

  • Understanding the structure of a Figure
  • Data visualization: scatter plots, line plots, box plots, bar charts,and histograms with matplotlib
  • Customizing plots: important attributes and arguments

  • Split-apply-combine with DataFrames
  • Data summarization and aggregation methods
  • Pandas powerful groupby method
  • Reshaping, pivoting, and transforming your data
  • Simple and rolling statistics

  • Deep learning of the data analysis tools through lectures, Q&A, and hands-on exercises
  • Develop transferable skills through application to authentic data sets
  • Predict the future with time series analysis
  • And more!

Open Class Schedule

Open Class Schedule

The course is 3 days, 9:00 AM to 5:00 PM. Onsite corporate classes are also available. Discounts are available for 3 or more attendees and academics currently at a degree-granting institution. Contact us with the form to the right to learn more.

There are no classes scheduled at this time. To request one, please contact us using the form to the right.

Contact Us

Questions or need help registering? Call us at 512.536.1057 or fill out the form:

FAQs

What is the difficulty level of the Pandas Mastery Workshop?

Overall the Pandas Mastery Workshop would be considered an intermediate-level course, and requires previous Python experience (specifically a working knowledge of the Python standard language and familiarity with array programming in NumPy). Some exposure to Pandas specifically is useful, but not required. The course is fast-paced and covers a lot of ground, so if you consider yourself an experienced beginner in Pandas then it should be a great fit.

I am worried that your training is only useful to people who are committed to using Enthought software products. How much of your training is usable without Enthought software?

100%. Our training teaches students how to write software with Python and solve problems using its scientific packages, not how to use proprietary software. Everything you will learn uses free and open source software. We provide Enthought Canopy (our integrated analysis environment and Python distribution) to training participants to ensure they have all of the tools and Python packages they need to complete the training and that the tools are as easy as possible to install. While participants sometimes do use other editors, package managers, and Python distributions, we strongly recommend participants use Canopy during the training. With Canopy we can ensure that you can easily install everything you need for the course out of the box and we can provide technical support (which we unfortunately cannot provide for other tool sets).

I use / will be using Anaconda Python. Will I still benefit from this course?

Absolutely. Our training materials work with any Python distribution (such as Anaconda), as long as you also have all of the necessary packages, a text or code editor, package manager, interactive IPython shell, and Jupyter notebooks installed.

Is a class completion certificate provided?

Yes, a class completion certificate is provided.

Have a question that isn’t answered here? Contact us or call 512.536.1057.

Testimonials

Terrific course! Perfect foundation to train my entire team of data scientists. We now have a common language and common set of tools for our daily research. I'm looking forward to seeing the full impact of this workshop over the next several months.

Carrie M.

The depth and breadth of this course will provide the foundation to efficient data manipulation and encapsulation.

Pandas Mastery Class Attendee

At first, I thought the fundamental work was not needed as I have had some decent experience with pandas but I am glad as some of the subtleties help me understand more about how pandas works.

Pandas Mastery Class Attendee

The finger exercises and the projects were both very helpful in solidifying the concepts.

Pandas Mastery Class Attendee

I felt the example working through real data such as the climate examples where extremely useful. Also the ability to ask questions and have side discussions on more advanced topics was very helpful.

Pandas Mastery Class Attendee

Overall, I thought it was a great class! I learned a lot of great tricks, and also solidified my understanding of the essentials. Thanks!

Pandas Mastery Class Attendee

After finishing Enthought's 'Python for Data Science' and 'Pandas Mastery Workshop' course series, I feel confident and prepared to tackle even the ugliest datasets around. Their teachers are very knowledgeable and do a great job explaining tricky topics with ease and clarity. I highly recommend their training to anyone whose workflow revolves around data.

William C.

Great class to scale up in Pandas, great instructor and excellent course-materials. Definitely recommendable.

Pandas Mastery Class Attendee