About This Course

This fast-paced class is intended for practicing data scientists, data analysts, and business intelligence experts interested in using Python for their day-to-day work. The primary focus is on learning to use Python tools for data science, data analysis, and machine learning efficiently and effectively.

Course Overview

Participants in this course will take away:

  • Hands-on experience setting up an integrated analysis environment for doing data science with Python.
  • An understanding of how to use the Python standard library to write programs, access the various data science tools, and document and automate analytic processes.
  • Orientation to some of the most powerful and popular Python libraries for data science including Pandas (data preparation, analysis, and modeling; time series analysis), scikit-learn (machine learning), Matplotlib and Seaborn (data visualization).
  • Working knowledge of the Python tools ideally suited for data science tasks, including:
    • Accessing data (e.g., text files, databases)
    • Cleansing and normalizing data
    • Exploring data (e.g., simple statistics, correlation matrices, visualization)
    • Modeling data (e.g., machine learning)

Contact Us

Questions or need help registering? Call us at 512.536.1057 or fill out the form:

Course Instructors

Enthought instructors have doctorates in scientific fields such as physics, engineering, computer science, and mathematics, and all have extensive experience through research and consulting in applying Python to solve complex problems across a range of industries, allowing them to bring their real world experience to the classroom every day. Enthought instructors possess professional, first-hand experience with the tools and technologies covered in our courses.

Testimonials

Course Syllabus & Topics

Course Prerequisites

The course assumes a working knowledge of key data science topics (statistics, machine learning, and general data analytic methods). Programming experience in some language (such as R, MATLAB, SAS, Mathematica, Java, C, C++, VB, or FORTRAN) is expected. In particular, participants need to be comfortable with general programming concepts like variables, loops, and functions. Experience with Python is helpful (but not required).

I. Introduction and Setting Up Your Integrated Analysis Environment

Setting Up Your Integrated Analysis Environment & Tools Overview

  • IPython Shell
  • Custom environment settings
  • Jupyter Notebooks
  • Script editor
  • Packages: NumPy, SciPy, scikit-learn, Pandas, Matplotlib, Seaborn, etc.

Once you complete this module, you will understand some of the unique benefits of using Python for data science / what features make Python particularly well-suited for data science, you will be able to set up a fully functioning Python-based analysis environment, and you will know what each tool is used for in the data science workflow.

II. Using Python to Control and Document Your Data Science Processes

Python Essentials

  • Data types and objects
  • Loading packages, namespaces
  • Reading and writing data
  • Simple plotting
  • Control flow
  • Debugging
  • Code profiling

Once you complete this module, you will be able to use the Python standard library plus Canopy tools to write, run, debug, and profile programs that control your data science processes (which draw on the scientific packages).

III. Accessing and Preparing Data

Acquiring Data with Python

  • Loading from CSV files
  • Accessing SQL databases

Cleansing Data with Python
  • Stripping out extraneous information
  • Normalizing data
  • Formatting data

Once you complete this module, you will know how to load data from common types of data sources, including structured text files and SQL databases. and you will know some of the common tools used in Python to cleanse and prepare your data for analysis.

IV. Numerical Analysis, Data Exploration, and Data Visualization with NumPy Arrays, Matplotlib, and Seaborn

NumPy Essentials

  • The NumPy array
  • N-dimensional array operations and manipulations
  • Memory mapped files

Data Visualization

  • 2D plotting with Matplotlib
  • Advanced data visualization with Seaborn

Once you complete this module, you will understand how to use NumPy arrays for efficient numerical processing and how to use NumPy methods such as slicing to write code that is both compact and easy to read and understand. You will know how to use Matplotlib, Seaborn, and NumPy together to explore and visualize your data.

V. Exploring Data with Pandas

Searching for Gold in a Pile of Pyrite

  • Data manipulation with Pandas
  • Statistical analysis with Pandas
  • Time series analysis with Pandas

At the end of this module, you will know how to access some of the core tools used for statistical analysis and data exploration in Python.

VI. Machine Learning with scikit-learn

Predicting the Future Can Be Good for Business

  • Input: 2D, samples, and features
  • Estimator, predictor, transformer interfaces
  • Pre-processing data
  • Regression
  • Classification
  • Model selection

At the end of this module you will have a working understanding of what machine learning tools are available in scikit-learn and how to use them.

Open Class Schedule

Onsite corporate classes are also available. Discounts are available for 3 or more attendees and academics currently at a degree-granting institution. Contact us to learn more.

WhereWhenPrice (per person)Register
Houston, TXOctober 2-6, 2017$2750Contact us with the form to the right
Albuquerque, NMOctober 16-20, 2017$2750
Washington, DCNovember 13-17, 2017$2750Contact us with the form to the right
London, UKNovember 20-24, 2017£2260Contact us with the form to the right
New York City, NYDecember 4-8, 2017$2750Contact us with the form to the right

Contact Us

Questions or need help registering? Call us 512.536.1057 or fill out the form below:

FAQs

  • Is a class completion certificate provided?
    • Yes, a class completion certificate is provided for the Python for Data Science class.

Have a question that isn’t answered here? Contact us or call 512.536.1057.

Testimonials

I did not think that I would learn Python programming in one week given that I do not have strong background in programming, but with Mr. Dickinson and the whole course I am ready to implement my reports to Python and effectively enhance their efficiency and quality.

Senior Manager - Investment Management, ADIA

This course was extremely helpful in understanding how to use IPython and pandas for doing data analysis and visualization. The instructor was very knowledgeable and when he did not know the answer to a question, he would always find the answer and get back to us with the information.

Senior Performance Engineer, Gogo LLC

Highly recommended if you want to learn or improve your Python. The trainer, Dr. Dickinson, was the best trainer I have ever encountered. He had a nice style of presenting and was very intelligent and knowledgeable. Easily answered even the most complex questions threw at him.

Software Engineer, Financial Services

This is a great class for anybody, new to Data Science or experienced. The instructor Dr Dickinson. is a Math/Python genius so he will answer all questions and guide you through all the Python tricks and traps/gotchas. All questions are answered. Overall I left the course with enough (...)

Data Analyst, Banking

As a novice to data science this course provided an excellent introduction to the tools and concepts available through Python for performing both rudimentary and complex analysis.

Senior Video Engineer, Time Warner Cable

The course and program were great. Right pace, right breadth of material to cover. Also, the instructor was very knowledgable and really did a good job of walking the fine line between providing enough support/guidance to us and letting us get ourselves into trouble before throwing us the life raft.

Analyst, International Development

Pietro is fantastic, and the course was well designed. It's clear that a lot of thought and prep was put into it.

The pace of the class was what attracted me. It's hard to find a class that assumes that you know how to code, but that you don't know Python. I was looking for something to (...)

Analyst, Data Science, USAID

I've taken two Enthought Python classes now. They are very effective and cover wide ranges of topics. If you are really committed and review materials you learned once more at night, the material you learned actually become very practical for your day-to-day work. Highly recommended.

Engineer, Nuclear Waste Management, Sandia

Really enjoyed the class. I had a varied level of knowledge of the different things that were covered. Those I knew well, I had holes filled in. Those I didn't know well, I learned quickly.

Data Scientist, Facebook

As a Python beginner without a computer programming background, I found that after this 5 day course I was able to work effectively in Python. This class was incredibly helpful in providing a foundation in a variety of areas for Data Science. Enthought not only has a great product but (...)

Data Analyst/Graduate Student

Jonathan Rocher is an exceptional teacher (having spent fourteen years in higher education, I have some basis for comparison). Teachers who can explain complex ideas with as much clarity as he does aren't too common; teachers who are so closely attuned to when their students are following and (...)

Data Scientist

Excellently taught course. Coming from a background of R and SAS, I finally understand Python beyond the basics and have necessary tools to really harness its power. This wouldn't have been possible without the instructor's deep knowledge, and the patience and willingness to share it!

Big Pharma

Learning from a professor who knows so much about both the internals of Python as a developer himself and about math and numbers was exceptional. Instructors knowledge and passion for the material was exceptional.

Sandia National Laboratories

This is a great overview of useful concepts that are available for data analysis and covering a broad base of tools and machine learning packages.

Sandia National Laboratories

Excellent, important, succinct way to build a knowledge base in Python, critical to helping me develop the skill to successfully implement my ideas in new ways. and the skill to do so successfully.

Marketing Analyst, Retail, Best Buy

A very insightful course which delivered by a true expert. I have left the course with hundreds of ideas upon which I can now act.

Research Manager, Oil Industry, Shell

You could tell from the demos, examples and exercises that this course was designed and taught by someone who has first hand experience of using the tools on real world and real life data.

Business Technologist, Glasgow Caledonian University

Working in data science for several years (but having no experience with Python), this course has made the transition to Python seamless and easy. The course has all the content you need to get ramped up on using Python for any existing analytics work you may have.

Revenue Management

The format -- going back and forth between lecture, finger exercises, and longer exercises -- was extremely helpful to get foundational understanding. It was a great way to actually walk away knowing how to use various libraries, as well as actually do it without cut/paste.

This course is a very good starting point to discover python and see how it could be used in real life to analyze data. The numerous examples and exercises are really useful for understanding all the concepts.

Senior Specialist - Developer

Learning the simplicity of machine learning techniques in python with sci-kit learn. The ease and ability to use pandas to slice and analyze data quickly as well as the overall structure of the python language.

The best part of the class was John Tyree. He was extremely knowledgeable about the mathematical techniques and scientific considerations of using the software rather than just the software itself. His advice and insight was very helpful.

The Lecturer, Dr. Dickinson, was excellent - His demonstration of his practical and historical knowledge of the python language combined with mathematics was invaluable. He provided tools and a road map on how to go about working with the language

Database Adminstrator and Data Analyst, HFT

The course instructor was a Python core developer so knew the language inside out. He was able to answer detailed questions we had on the mechanics of Python, or at least know exactly where we could find the right information.

Research Associate, Earth/Environmental Science

The class was delivered effectively,whilst achieving the right balance between demo and exercises. The instuctor is very good and shares his knowledge at a good pace.

Interim Modelling Manager, British Gas

Everyone in the class is very impressed by our instructor. He knows Python inside out. He encourages questions and answer our questions thoroughly. He gave us a lot to think about, such as, things going on behind the Python syntax, pointers on how to write fast Python. He really made this (...)

Data Scientist, Zurich Insurance

The instructor demonstrated excellent domain knowledge with an effective combination of demonstrations, lecture slides, and exercises

Senior Data Scientist, Technology Industry, Cobalt Light Systems

I found this course to be excellent for quickly getting me immersed in leveraging Python for Data Science and Text Analytics.

IT Specialist, SEC

Dr. Rocher is an excellent instructor. Passionate, engaging and very experienced. He is very approachable and enjoys taking the time to discuss and assist outside of class.

Data Analyst

Dr. Dickinson is very articulate in his approach towards this course often explaining in-depth the topics well beyond the documented slides.

I have an undergraduate and masters degree in statistics, but Dr. Berke's explanation of Principal Component analysis is the best I've ever heard!

This is probably the most concise Data Science primer course you can find; 100% worth it.

I particularly liked the overview of Pandas, Visualisation and SciKit Learn (days 3-5). However, Corran was highly competent at explaining all topics which made the whole course highly useful and excellent!

Data Scientist/Engineer

John Tyree's accessability/approachablity, his scientific literacy & ability to translate challenging concepts into "plain English."

Developer/Scientist, National Research Laboratory

Great course to assess the usefulness of Python for data science. Gives you a lot of pointers and saves you time. There are things you can not learn so quickly on your own: data science for Python is one of them. Learned a lot.

Mark is an excellent instructor who conveyed his enthusiasm for the subject and dealt adeptly with the various levels of experience in the class.

I was very impressed with the instructor. Great mastery of the subject matter and a good communicator. He was very enthusiastic about the subject, and cared that we were 'getting it.'

Energy Utitlies

Enthought provided really high-quality training on data science aspects of Python. I came out of the training with fluency in my ability to run regression models using various algorithms.

Marketing Analytics, Consumer Products, LifeLock

Alex was very clear and concise, as well as extremely knowledgable on the topics covered The course structure was ideal for learning a language

Excellent Data Science Course for Beginners and Practitioners!

UCSF

Dr. Rocher is a great teacher, who can explain complex topics (especially stats) in a very simple manner.

Excellent class, best introduction into Python and Data Analysis that I have seen.

Jonathan and Alex are extremely helpful and understand python very well! Thank you!

Great course, python is demystified in less than a week. I'm ready to apply it to my projects.

Data Scientist

High marks for you, Mike. We were all impressed by your teaching style and knowledge of python.

Zurich