Back to Courses

Data Analysis Courses - Page 26

Showing results 251-260 of 998
The Total Data Quality Framework
By the end of this first course in the Total Data Quality specialization, learners will be able to: 1. Identify the essential differences between designed and gathered data and summarize the key dimensions of the Total Data Quality (TDQ) Framework; 2. Define the three measurement dimensions of the Total Data Quality framework, and describe potential threats to data quality along each of these dimensions for both gathered and designed data; 3. Define the three representation dimensions of the Total Data Quality framework, and describe potential threats to data quality along each of these dimensions for both gathered and designed data; and 4. Describe why data analysis defines an important dimension of the Total Data Quality framework, and summarize potential threats to the overall quality of an analysis plan for designed and/or gathered data. This specialization as a whole aims to explore the Total Data Quality framework in depth and provide learners with more information about the detailed evaluation of total data quality that needs to happen prior to data analysis. The goal is for learners to incorporate evaluations of data quality into their process as a critical component for all projects. We sincerely hope to disseminate knowledge about total data quality to all learners, such as data scientists and quantitative analysts, who have not had sufficient training in the initial steps of the data science process that focus on data collection and evaluation of data quality. We feel that extensive knowledge of data science techniques and statistical analysis procedures will not help a quantitative research study if the data collected/gathered are not of sufficiently high quality. This specialization will focus on the essential first steps in any type of scientific investigation using data: either generating or gathering data, understanding where the data come from, evaluating the quality of the data, and taking steps to maximize the quality of the data prior to performing any kind of statistical analysis or applying data science techniques to answer research questions. Given this focus, there will be little material on the analysis of data, which is covered in myriad existing Coursera specializations. The primary focus of this specialization will be on understanding and maximizing data quality prior to analysis.
Tools for Data Science
In order to be successful in Data Science, you need to be skilled with using tools that Data Science professionals employ as part of their jobs. This course teaches you about the popular tools in Data Science and how to use them. You will become familiar with the Data Scientist’s tool kit which includes: Libraries & Packages, Data Sets, Machine Learning Models, Kernels, as well as the various Open source, commercial, Big Data and Cloud-based tools. Work with Jupyter Notebooks, JupyterLab, RStudio IDE, Git, GitHub, and Watson Studio. You will understand what each tool is used for, what programming languages they can execute, their features and limitations. This course gives plenty of hands-on experience in order to develop skills for working with these Data Science Tools. With the tools hosted in the cloud on Skills Network Labs, you will be able to test each tool and follow instructions to run simple code in Python, R, or Scala. Towards the end the course, you will create a final project with a Jupyter Notebook. You will demonstrate your proficiency preparing a notebook, writing Markdown, and sharing your work with your peers.
Data Analysis and Visualization
By the end of this course, learners are provided a high-level overview of data analysis and visualization tools, and are prepared to discuss best practices and develop an ensuing action plan that addresses key discoveries. It begins with common hurdles that obstruct adoption of a data-driven culture before introducing data analysis tools (R software, Minitab, MATLAB, and Python). Deeper examination is spent on statistical process control (SPC), which is a method for studying variation over time. The course also addresses do’s and don’ts of presenting data visually, visualization software (Tableau, Excel, Power BI), and creating a data story. Material features online lectures, videos, demos, project work, readings and discussions. This course is ideal for individuals keen on developing a data-driven mindset that derives powerful insights useful for improving a company’s bottom line. It is helpful if learners have some familiarity with reading reports, gathering and using data, and interpreting visualizations. It is the second course in the Data-Driven Decision Making (DDDM) specialization. To learn more about the specialization, check out a video overview at https://www.youtube.com/watch?v=Oi4mmeSWcVc&list=PLQvThJe-IglyYljMrdqwfsDzk56ncfoLx&index=11.
D3Js Basics
In this 1.5-hour long project-based course I will show you the basic concepts to create data visualizations in D3.js. You will learn how to use SVGs, select, and bind data in order to create a  bar chart. We will be visualizing firecracker injuries in 2019. This data is from the U.S. Consumer Product Safety Commission (CPSC) Fireworks Annual Report. Inspired by recent events this summer we are going to visualize firework injuries in the United States.  Firework injury data can be found at U.S. Consumer Product Safety Commission (CPSC) .
Analytical Solutions to Common Healthcare Problems
In this course, we’re going to go over analytical solutions to common healthcare problems. I will review these business problems and you’ll build out various data structures to organize your data. We’ll then explore ways to group data and categorize medical codes into analytical categories. You will then be able to extract, transform, and load data into data structures required for solving medical problems and be able to also harmonize data from multiple sources. Finally, you will create a data dictionary to communicate the source and value of data. Creating these artifacts of data processes is a key skill when working with healthcare data.
Manipulating Data with SQL
In this course you will learn to write the SQL code to manipulate the data in a relational database table. You’ll begin by populating the table with data. Since a database and its tables are designed and built to be repositories of data, getting the data into the tables is a critical activity in the building of a working database. When building a new home, the real test of your design comes when the furniture and family move in. It’s much the same with designing and building database tables—the real test comes when you load data into the tables and begin to use it. As you work through and complete hands-on tasks, you’ll become familiar with SQLiteStudio, the database management system used in the course. You’ll experience first-hand the impact data types and constraints have on manipulating table data. For example, as you enter new data into a table, you’ll appreciate the extra protection provided by the primary key constraint. It will not let you insert two rows into a table that are exactly alike. In addition to adding data to the tables, you’ll write the SQL code used to modify existing data values and to delete rows of data. Managing and manipulating data are SQL’s primary purposes, and SQL coding will be a powerful addition to your tool set. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Introduction to Python Functions
How many times have you decided to learn a programming language but got stuck somewhere along the way, grew frustrated, and gave up? This specialization is designed for learners who have little or no programming experience but want to use Python as a tool to play with data. In the second course, Introduction to Python Functions, you are going to learn and use functions predefined in Python and Python packages, you also are able to define functions as well. You will create and use functions to make your programs reusable and adaptive. Are you ready? Let's go! Logo image courtesy of Mourizal Zativa. Available on Unsplash here: https://unsplash.com/photos/gNMVpAPe3PE
Data calculations in Microsoft Excel
By the end of this project, you will create a free account on Microsoft 365, will get access to Microsoft Excel, you will create a table with data that you will later be able to manipulate to calculate sums, average amounts, percentages, and also calculate with certain criteria. Your new skills will help you efficiently manipulate data and operate with formulas that can help make more efficient management decisions in the future.
Data Visualization Capstone
Data visualization is a critical skill for anyone that routinely using quantitative data in his or her work - which is to say that data visualization is a tool that almost every worker needs today. One of the critical tools for data visualization today is the R statistical programming language. Especially in conjunction with the tidyverse software packages, R has become an extremely powerful and flexible platform for making figures, tables, and reproducible reports. However, R can be intimidating for first time users, and there are so many resources online that it can be difficult to sort through without guidance. This is the final course in the Specialization "Data Visualization and Dashboarding in R." Learners in this course will enter with a well-developed set of skills making a wide variety of visualizations in R. The focus on this course will applying those skills to a unique project, drawing on publicly available data to tell a compelling story using the data visualization toolkit assembled in the previous courses.
Artificial Intelligence Algorithms Models and Limitations
We live in an age increasingly dominated by algorithms. As machine learning models begin making important decisions based on massive datasets, we need to be aware of their limitations in the real world. Whether it's making loan decisions or re-routing traffic, machine learning models need to accurately reflect our shared values. In this course, we will explore the rise of algorithms, from the most basic to the fully-autonomous, and discuss how to make them more ethically sound.