Back to Courses

Data Analysis Courses - Page 95

Showing results 941-950 of 998
Plots Creation using Matplotlib Python
By the end of this project, you will be able to add the data in the CSV file to Pandas data frame, plot the graph, and set marker type and color. You will also be able to apply labels, change font size, add grid lines and legends. Finally, you will be able to create the boxplot and save the graph as an image using the matplotlib and seaborn libraries, which are the most important libraries in python that are used for Data Visualization. You can create bar-plots, scatter-plots, histograms, and a lot more with them. This guided project is for people in the field of data and data analysis. people who want to learn python and Pandas library. It provides you with the important steps to be a data analyst. Moreover, it equips you with the knowledge of python's native data structures
Optimizing Performance of LookML Queries
This is a Google Cloud Self-Paced Lab. In this lab, you'll learn the best methods to optimize query performance in Looker. Looker is a modern data platform in Google Cloud that you can use to analyze and visualize your data interactively. You can use Looker to do in-depth data analysis, integrate insights across different data sources, build actionable data-driven workflows, and create custom data applications. Big, complex queries can be costly, and running them repeatedly strains your database, thereby reducing performance. Ideally, you want to avoid re-running massive queries if nothing has changed, and instead, append new data to existing results to reduce repetitive requests. Although there are many ways to optimize performance of LookML queries, this lab focuses on the most commonly used methods to optimize query performance in Looker: persistent derived tables, aggregate awareness, and performantly joining views.
Custom & Conditional Formatting in Microsoft Excel
At the end of this project, you will be able to customize your formatting in Microsoft Excel. You will also be able to use basic conditional formatting to ensure a seamless use of Microsoft Excel and to highlight important sections of your sheet. Being able to use custom and conditional formatting in Microsoft Excel will allow you to make sense of the data quickly. This allows you to visualize your data and make it easier for you and for others to understand the data. This becomes more important the more data is contained in the sheet. The goal of this project is to allow you to visualize your data for yourself and others using Microsoft Excel.
Data Catalog: Qwik Start
This is a self-paced lab that takes place in the Google Cloud console. In this lab you will explore existing datasets with Data Catalog and mine the table and column metadata for insights.
Azure Data Lake Storage Gen2 and Data Streaming Solution
In this course, you will see how Azure Data Lake Storage can make processing Big Data analytical solutions more efficient and how easy it is to set up. You will also explore how it fits into common architectures, as well as the different methods of uploading the data to the data store. You will examine the myriad of security features that will ensure your data is secure. Learn the concepts of event processing and streaming data and how this applies to Azure Stream Analytics. You will then set up a stream analytics job to stream data, and learn how to manage and monitor a running job. This course is part of a Specialization intended for Data engineers and developers who want to demonstrate their expertise in designing and implementing data solutions that use Microsoft Azure data services for anyone interested in preparing for the Exam DP-203: Data Engineering on Microsoft Azure (beta). You will take a practice exam that covers key skills measured by the certification exam. This is the ninth course in a program of 10 courses to help prepare you to take the exam so that you can have expertise in designing and implementing data solutions that use Microsoft Azure data services. The Data Engineering on Microsoft Azure exam is an opportunity to prove knowledge expertise in integrating, transforming, and consolidating data from various structured and unstructured data systems into structures that are suitable for building analytics solutions that use Microsoft Azure data services. Each course teaches you the concepts and skills that are measured by the exam. By the end of this Specialization, you will be ready to take and sign-up for the Exam DP-203: Data Engineering on Microsoft Azure (beta).
RPA Lifecycle: Introduction, Discovery and Design
Regardless of the industry, business process automation is here to stay. Business processes are increasingly being automated through software automation. But how is software automation simplified and made accessible? The answer is Robotic Process Automation or RPA. RPA lifecycle: Intrdocution, Discovery and Design is the first course of the Specialization on Implementing RPA with Cognitive Solutions and Analytics. In this course, you will be introduced to RPA and see how it is combined with cognitive solutions and analytics to enable organizations to adopt the model of a “Digital Workforce” and help them transform the nature of their work. You will also learn about the first two phases of the RPA implementation lifecycle: Discovery and Design. In the Discovery phase, you take a set of business processes and identify those that are suitable and beneficial for automation. In the Design phase, you create a plan for your RPA project. The learning will be reinforced through concept description and guided practice.
Analyse datasets with Java streams
In this 1-hour long project-based course, you will learn how to create a Java Stream object based on a CSV data file, and engineer its data using Stream and Collector methods. You will explore the dataset using stream methods, and then apply a reduction operations on the data using a range of Collector methods. You will learn how to join and split strings in the data. You will apply the groupingBy method for grouping your data stream based on fields in your data object. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Ensemble Methods in Machine Learning
In this 2-hour long project-based course, you will learn how to implement various ensemble techniques and use it in machine learning. Ensemble models in machine learning combine the decisions from multiple models to improve the overall performance, The main causes of error in learning models are due to noise, bias and variance, Ensemble methods help to minimize these factors.
Python and Pandas for Data Engineering
In this first course of the Python, Bash and SQL Essentials for Data Engineering Specialization, you will learn how to set up a version-controlled Python working environment which can utilize third party libraries. You will learn to use Python and the powerful Pandas library for data analysis and manipulation. Additionally, you will also be introduced to Vim and Visual Studio Code, two popular tools for writing software. This course is valuable for beginning and intermediate students in order to begin transforming and manipulating data as a data engineer.
Machine Learning for Data Analysis
Are you interested in predicting future outcomes using your data? This course helps you do just that! Machine learning is the process of developing, testing, and applying predictive algorithms to achieve this goal. Make sure to familiarize yourself with course 3 of this specialization before diving into these machine learning concepts. Building on Course 3, which introduces students to integral supervised machine learning concepts, this course will provide an overview of many additional concepts, techniques, and algorithms in machine learning, from basic classification to decision trees and clustering. By completing this course, you will learn how to apply, test, and interpret machine learning algorithms as alternative methods for addressing your research questions.