Professional Development

Data Wrangling and Visualization

According to a New York Times article by Steve Lohr (2014), data scientists spend 50% to 80% of their time on data cleaning and transformation processes called data wrangling and 20%-50% of their time on data modeling, implying the importance of skills needed for the data wrangling task.

“Data scientists, according to interviews and expert estimates, spend from 50 percent to 80 percent of their time mired in this more mundane labor of collecting and preparing unruly digital data, before it can be explored for useful nuggets (Steve Lohr, August 17, 2014).”

However, most degree programs focus on data modeling, presumably because that is most technically challenging and worthy of a degree. Most courses in various types of data science programs do not offer a course in data wrangling and visualization systematically, but they expect students to use data wrangling and visualization in conjunction with modeling, making students face two challenges at the same time. The same is true in most statistics classes. Students have to deal with learning not only statistics topics but also programming software. Thus, this certification is designed to help students without much basic knowledge of R, a primary statistical analysis software used by data scientists, by giving them the necessary knowledge in programming so that they can focus more on statistics/machine learning topics in their future endeavors. Further, this course is also aimed to give data science aspirants introductory knowledge and skills to help them get started.

Module 1: Orientation and Introduction to R and RStudio 
Module 2: Data Types and Structure in R
Module 3: Data Input and Output
Module 4: Data Manipulation with Tidyverse
Module 5: R Programming 
Module 6: Data Visualization with R and Capstone Project

  • Students who want to take various data science programs (e.g., MS in Business Analytics, etc.) and various statistics courses at undergraduate as well as graduate levels.
  • Company employees who need to learn R Programming.
  • Anyone who wants to have a career in data science and business analytics.
  • Anyone who wants to learn R Programming.

Convenience with Responsibility
  • Take the program anywhere in the world as the program is delivered online.
  • Fully asynchronous offering, meaning that there is no set class time. Takes one week to finish one module and six weeks to finish all modules.
  • However, you will be required to manage your time such that the assignment associated with each module is required to be finished by the deadline set on Canvas.

Learning Objectives

  • Each module will follow the Quality Matters framework that has been proven effective for online learning success. That is, each module will start with learning outcomes, followed by step-by-step instructions, including a one-hour video lecture, supplemental materials to reinforce the lecture, and practice assignment(s).
  • An assignment will be given out for each topic and graded with feedback in order to ensure that students can apply what they learned to a different task.

Rigorous Assessment and Verification

  • To receive a certificate of achievement, participants must receive at least a grade of C from each module and pass a capstone project assignment. 
  • Watching a video is never sufficient to demonstrate your knowledge and skills in the topic. 
  • The certificate is issued only when you demonstrate that you achieved the learning outcomes.

Dr. Jae Min Jung is a Professor of Marketing and the director of the Center for Customer Insights and Digital Marketing (CCIDM) at Cal Poly Pomona. He received a Ph.D. in Marketing from the University of Cincinnati and an MBA degree with a concentration in Business Statistics from the University of North Texas. Dr. Jung’s research in his early career was focused on the impact of cultural values on various persuasion and decision-making issues of consumer psychology and marketing, including social influence strategies and consumers’ retail interaction style. More recently, he also investigated various digital marketing topics such as online advertising (i.e., interactive banner, game ads) and social media marketing. Currently, Dr. Jung is interested in applying econometrics and data science methods to consumer behaviors. His research has been published in renowned journals such as European Journal of Marketing, International Journal of Hospitality Management, International Marketing Review, Journal of Business Research, Journal of Cross-Cultural Psychology, Marketing letters, Public Relations Review, and Psychology & Marketing. In addition, Dr. Jung teaches Marketing Research, Data Mining for Marketing Decisions, and Business Analytics Project Courses at both graduate and undergraduate levels.

Course Offering:

Spring 2022

Data Wrangling and Visualization Certificate
Register  Course Code Session Dates Fees Instructor
DWV100 (SP22-1) Feb 7, 2022 - Mar 20, 2022 $199 Jae Min Jung


CEU Footer

CEU Footer

The current course schedule which includes class meeting times, locations and design is subject to change without notice or obligation. The University does not guarantee the availability of a particular course or section, and reserves the right to cancel due to insufficient enrollment.

College of the Extended University logo
3801 W. Temple Ave.
Bldg 220C-140
Pomona CA 91768

Phone: 909-869-2288
Office Hours:
Monday – Friday 
8:00 AM to 5:00 PM

Follow Us