Back to Courses

Data Science Courses - Page 65

Showing results 641-650 of 1407
Working with Big Data
By the end of this project, you will set up an environment for Big Data Development using Visual Studio Code, MongoDB and Apache Spark. You will then use the environment to process a large dataset from NOAA showing hourly precipitation rates for a ten year period from the state of Wisconsin. MongoDB is a widely used NoSQL database well suited for very large datasets or Big Data. It is highly scalable and adaptable as well. Apache Spark is used for efficient in-memory processing of Big Data.
Capstone Project: Advanced AI for Drug Discovery
In this capstone project course, we'll compare genome sequences of COVID-19 mutations to identify potential areas a drug therapy can look to target. The first step in drug discovery involves identifying target subsequences of theirs genome to target. We'll start by comparing the genomes of virus mutations to look for similarities. Then, we'll perform PCA to cut down our number of dimensions and identify the most common features. Next, we'll use K-means clustering in Python to find the optimal number of groups and trace the lineage of the virus. Finally, we'll predict similarity between the sequences and use this to pick a target subsequence. Throughout the course, each section will consist of a programming assignment coupled with a guide video and helpful hints. By the end, you'll be well on your way to discovering ways to combat disease with genome sequencing.
Designing Autonomous AI
When children learn how to hit a baseball, they don’t start with fastballs. Their coaches begin with the basics: how to grip the handle of the bat, where to put their feet and how to keep their eyes on the ball. Similarly, an autonomous AI system needs a subject matter expert (SME) to break a complex process or problem into easier tasks that give the AI important clues about how to find a solution faster. In this course, you’ll learn how to distill a business challenge into its component parts by creating an autonomous AI design plan. Using lessons, goal setting, skills, strategies and rewards, you’ll incorporate your SME’s knowledge directly into your AI’s “brain,” the agent that powers your autonomous system. You'll learn when and how to combine various AI architecture design patterns, as well as how to design an advanced AI at the architectural level without worrying about the implementation of neural networks or machine learning algorithms. At the end of this course, you’ll be able to: • Interview SMEs to extract their unique knowledge about a system or process • Combine reinforcement learning with expert rules, optimization and mathematical calculations in an AI brain • Design an autonomous AI brain from modular components to guide the learning process for a particular task •. Validate your brain design against existing expertise and techniques for solving problems • Produce a detailed specifications document so that someone else can build your AI brain This course is part of a specialization called Autonomous AI for Industry, which will launch in fall 2022.
Create Geovisualizations in Tableau
Tableau is widely recognized as one of the premier data visualization software programs. For many years access to the program was limited to those who purchased licenses. Recently, Tableau launched a public version that grants the ability to create amazing data visualizations for free. Account members can also share and join projects to collaborate on projects that can change the world. In this project, we will learn how to create an account, how to build simple bar graphs, and how to create a geovisualization. Learning to use this in-demand tool has applications in Marketing, Finance, Operations, Sales, and many other business functions. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Data Studio: Qwik Start
This is a self-paced lab that takes place in the Google Cloud console. Data Studio lets you create dynamic, visually compelling reports and dashboards. Watch the short video <A HREF="https://youtu.be/eSY6daAlk5k">Data Studio: Qwik Start - Qwiklabs Preview</A>.
Using Tensorflow for Image Style Transfer
Have you ever wished you could paint like Van Gogh, Monet or even Picasso? Better yet, have you wished for an easy way to convert your own images into new ones incorporating the style of these famous artists? With Neural Style Transfer, Convolutional Neural Networks (CNNs) distill the essence of the style of any famous artist it is fed, and are able to transfer that style to any other image. In this project-based course, you will learn how to utilize Python and Tensorflow to build a Neural Style Transfer (NST) model using a VGG19 CNN. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
AI Workflow: Feature Engineering and Bias Detection
This is the third course in the IBM AI Enterprise Workflow Certification specialization.    You are STRONGLY encouraged to complete these courses in order as they are not individual independent courses, but part of a workflow where each course builds on the previous ones.   Course 3 introduces you to the next stage of the workflow for our hypothetical media company.  In this stage of work you will learn best practices for feature engineering, handling class imbalances and detecting bias in the data.  Class imbalances can seriously affect the validity of your machine learning models, and the mitigation of bias in data is essential to reducing the risk associated with biased models.  These topics will be followed by sections on best practices for dimension reduction, outlier detection, and unsupervised learning techniques for finding patterns in your data.  The case studies will focus on topic modeling and data visualization.   By the end of this course you will be able to: 1.  Employ the tools that help address class and class imbalance issues 2.  Explain the ethical considerations regarding bias in data 3.  Employ ai Fairness 360 open source libraries to detect bias in models 4.  Employ dimension reduction techniques for both EDA and transformations stages 5.  Describe topic modeling techniques in natural language processing 6.  Use topic modeling and visualization to explore text data 7.  Employ outlier handling best practices in high dimension data 8.  Employ outlier detection algorithms as a quality assurance tool and a modeling tool 9.  Employ unsupervised learning techniques using pipelines as part of the AI workflow 10.  Employ basic clustering algorithms   Who should take this course? This course targets existing data science practitioners that have expertise building machine learning models, who want to deepen their skills on building and deploying AI in large enterprises. If you are an aspiring Data Scientist, this course is NOT for you as you need real world expertise to benefit from the content of these courses.   What skills should you have? It is assumed that you have completed Courses 1 and 2 of the IBM AI Enterprise Workflow specialization and you have a solid understanding of the following topics prior to starting this course: Fundamental understanding of Linear Algebra; Understand sampling, probability theory, and probability distributions; Knowledge of descriptive and inferential statistical concepts; General understanding of machine learning techniques and best practices; Practiced understanding of Python and the packages commonly used in data science: NumPy, Pandas, matplotlib, scikit-learn; Familiarity with IBM Watson Studio; Familiarity with the design thinking process.
Join Tables in an SQL Query in LibreOffice Base
By the end of this project, you will have used LibreOffice Base to write SQL queries that join related tables together to retrieve and display data from multiple tables. Since a well-designed database stores each data topic—such as “student” and “course”—in a separate table, an SQL query is often required to join tables together to get all of the data required. You will learn to do that with the WHERE clause and with the JOIN command in SQL. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Investment Risk Management
By the end of the project, you will learn how to quantify risk-to-reward using Treynor Ratio, and calculate the value at risk for investment portfolio. ATTENTION: To take this course, it is required that you are familiar basic financial risk management concepts. You can gain them by taking the guided project Compare Stock Returns with Google Sheets. Note: This course works best for learners who are based in the North America region. We're currently working on providing the same experience in other regions. This course's content is not intended to be investment advice and does not constitute an offer to perform any operations in the regulated or unregulated financial market
Deep Learning with PyTorch : Build an AutoEncoder
In these one hour project-based course, you will learn to implement autoencoder using PyTorch. An autoencoder is a type of neural network that learns to copy its input to its output. In autoencoder, encoder encodes the image into compressed representation, and the decoder decodes the representation to reconstruct the image. We will use autoencoder for denoising hand written digits using a deep learning framework like pytorch. This guided project is for learners who want to use pytorch for building deep learning models.Learners who want to apply autoencoder practically using PyTorch. In order to be successful in this project, you should be familiar with python , basic pytorch like creating or defining neural network and convolutional neural network.