Back to Courses

Data Analysis Courses - Page 38

Showing results 371-380 of 998
Basic Descriptives using R Cmdr
In this 1-hour long project-based course, we will show you how to do basic descriptives using RCmdr .You will learn about measures of central tendency and dispersion. This project uses data about cereals that you eat and details about their sugar, fiber calorie content. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Python Packages for Data Science
How many times have you decided to learn a programming language but got stuck somewhere along the way, grew frustrated, and gave up? This specialization is designed for learners who have little or no programming experience but want to use Python as a tool to play with data. Now that you have mastered the fundamentals of Python and Python functions, you will turn your attention to Python packages specifically used for Data Science, such as Pandas, Numpy, Matplotlib, and Seaborn. Are you ready? Let's go! Logo image courtesy of Mourizal Zativa. Available on Unsplash here: https://unsplash.com/photos/gNMVpAPe3PE
Foundations for Big Data Analysis with SQL
In this course, you'll get a big-picture view of using SQL for big data, starting with an overview of data, database systems, and the common querying language (SQL). Then you'll learn the characteristics of big data and SQL tools for working on big data platforms. You'll also install an exercise environment (virtual machine) to be used through the specialization courses, and you'll have an opportunity to do some initial exploration of databases and tables in that environment. By the end of the course, you will be able to • distinguish operational from analytic databases, and understand how these are applied in big data; • understand how database and table design provides structures for working with data; • appreciate how differences in volume and variety of data affects your choice of an appropriate database system; • recognize the features and benefits of SQL dialects designed to work with big data systems for storage and analysis; and • explore databases and tables in a big data platform. To use the hands-on environment for this course, you need to download and install a virtual machine and the software on which to run it. Before continuing, be sure that you have access to a computer that meets the following hardware and software requirements: • Windows, macOS, or Linux operating system (iPads and Android tablets will not work) • 64-bit operating system (32-bit operating systems will not work) • 8 GB RAM or more • 25GB free disk space or more • Intel VT-x or AMD-V virtualization support enabled (on Mac computers with Intel processors, this is always enabled; on Windows and Linux computers, you might need to enable it in the BIOS) • For Windows XP computers only: You must have an unzip utility such as 7-Zip or WinZip installed (Windows XP’s built-in unzip utility will not work)
Prepare Data for Exploration
This is the third course in the Google Data Analytics Certificate. These courses will equip you with the skills needed to apply to introductory-level data analyst jobs. As you continue to build on your understanding of the topics from the first two courses, you’ll also be introduced to new topics that will help you gain practical data analytics skills. You’ll learn how to use tools like spreadsheets and SQL to extract and make use of the right data for your objectives and how to organize and protect your data. Current Google data analysts will continue to instruct and provide you with hands-on ways to accomplish common data analyst tasks with the best tools and resources. Learners who complete this certificate program will be equipped to apply for introductory-level jobs as data analysts. No previous experience is necessary. By the end of this course, you will: - Find out how analysts decide which data to collect for analysis. - Learn about structured and unstructured data, data types, and data formats. - Discover how to identify different types of bias in data to help ensure data credibility. - Explore how analysts use spreadsheets and SQL with databases and data sets. - Examine open data and the relationship between and importance of data ethics and data privacy. - Gain an understanding of how to access databases and extract, filter, and sort the data they contain. - Learn the best practices for organizing data and keeping it secure.
Data Analysis in R with RStudio & Tidyverse
Code and run your first R program in minutes without installing anything! This course is designed for learners with no prior coding experience, providing foundational knowledge of data analysis in R. The modules in this course cover descriptive statistics, importing and wrangling data, and using statistical tests to compare populations and describe relationships. This course presents examples in R using the industry-standard Integrated Development Environment (IDE) RStudio. To allow for a truly hands-on, self-paced learning experience, this course is video-free. Assignments contain short explanations with images and runnable code examples with suggested edits to explore code examples further, building a deeper understanding by doing. You’ll benefit from instant feedback from a variety of assessment items along the way, gently progressing from quick understanding checks (multiple choice, fill in the blank, and un-scrambling code blocks) to small, approachable coding exercises that take minutes instead of hours. Finally, a cumulative lab at the end of the course will provide you an opportunity to apply all learned concepts within a real-world context.
Data Science at Scale - Capstone Project
In the capstone, students will engage on a real world project requiring them to apply skills from the entire data science pipeline: preparing, organizing, and transforming data, constructing a model, and evaluating results. Through a collaboration with Coursolve, each Capstone project is associated with partner stakeholders who have a vested interest in your results and are eager to deploy them in practice. These projects will not be straightforward and the outcome is not prescribed -- you will need to tolerate ambiguity and negative results! But we believe the experience will be rewarding and will better prepare you for data science projects in practice.
Breast Cancer Prediction Using Machine Learning
In this 2 hours long project-based course, you will learn to build a Logistic regression model using Scikit-learn to classify breast cancer as either Malignant or Benign. We will use the Breast Cancer Wisconsin (Diagnostic) Data Set from Kaggle. Our goal is to use a simple logistic regression classifier for cancer classification. We will be carrying out the entire project on the Google Colab environment. You will need a free Gmail account to complete this project. Please be aware of the fact that the dataset and the model in this project, can not be used in real-life. We are only using this data for educational purposes. By the end of this project, you will be able to build the logistic regression classifier to classify between cancerous and noncancerous patients. You will also be able to set up and work with the Google colab environment. Additionally, you will also be able to clean and prepare data for analysis. You should be familiar with the Python Programming language and you should have a theoretical understanding of the Logistic Regression algorithm. You will need a free Gmail account to complete this project. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Validate Data in SQL using MySQL Workbench
By the end of this project, you will validate MySQL data in a MySQL database using SQL Triggers in MySQL Workbench. MySQL is a widely used relational database. Often data is validated by applications before being inserted into a database. It is a good idea to validate data at the database level, since applications may use inconsistent validation leaving the data at risk. MySQL workbench provides a User Interface to MySQL that allows the creation of triggers to perform validation before an insertion or update is performed.
Analyze Stock Data using R and Quantmod Package
In this 1-hour long project-based course, you will learn how to pull down Stock Data using the R quantmod package. You will also learn how to perform analytics and pass financial risk functions to the data. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Advanced Data Visualization with R
Data visualization is a critical skill for anyone that routinely using quantitative data in his or her work - which is to say that data visualization is a tool that almost every worker needs today. One of the critical tools for data visualization today is the R statistical programming language. Especially in conjunction with the tidyverse software packages, R has become an extremely powerful and flexible platform for making figures, tables, and reproducible reports. However, R can be intimidating for first time users, and there are so many resources online that it can be difficult to sort through without guidance. This course is the third in the Specialization "Data Visualization and Dashboarding in R." Learners come into this course with a foundation using R to make many basic kinds of visualization, primarily with the ggplot2 package. Accordingly, this course focuses on expanding the learners' inventory of data visualization options. Drawing on additional packages to supplement ggplot2, learners will made more variants of traditional figures, as well as venture into spatial data. The course ends make interactive and animated figures. To fill that need, this course is intended for learners who have little or no experience with R but who are looking for an introduction to this tool. By the end of this course, students will be able to import data into R, manipulate that data using tools from the popular tidyverse package, and make simple reports using R Markdown. The course is designed for students with good basic computing skills, but limited if any experience with programming.