DelftX: AI Skills for Engineers: Data Engineering and Data Pipelines

Good data is central to effective AI applications. This course teaches the basics of data for AI, covering what data is needed, how to extract data from existing databases and basic data skills including setup of a Python notebook environment, basic data exploration and simple data visualizations.

6 semanas

5–7 horas por semana

A tu ritmo

Avanza a tu ritmo

Este curso está archivado

Ver el material del curso

Me gustaría recibir correos electrónicos de DelftX e informarme sobre otras ofertas relacionadas con AI Skills for Engineers: Data Engineering and Data Pipelines.

Sobre este curso

Omitir Sobre este curso

Artificial Intelligence and Machine Learning have become central techniques for most services and products, ranging from web-based systems to medical procedures, self-driving cars – even intelligent coffee makers.

Alongside algorithms, data is central to AI applications. Without solid data management, AI projects typically underperform or even fail. Unfortunately, the relevance and complexity of handling data is frequently underestimated.

That’s why we developed this course which covers foundational questions like “Why is data important to AI?” and “What data does AI need?” and covers more application-oriented topics and skills like how to extract, load and query data using an SQL pipeline.

In the second part of the course, you will learn basic data engineering skills, including how to setup your Python notebook environment, explore data with advanced pandas functions, and create simple and clear data visualizations.

This introductory course is targeted at learners with little experience in data management or Python-based data management who want to develop Python-based AI applications in the future. The course covers a brief introduction into data management for AI, relational data management (e.g., SQL), and practical data handling skills in Python, pandas, and Jupyter.

This allows you to build a foundation to prepare for future AI and Machine Learning development with Python.

De un vistazo

Institution DelftX
Subject Análisis de datos
Level Introductory
Prerequisites
- Basic Python programming skills
- A computer running Windows, MacOS, Linux is needed. Chromebooks and mobile operating systems are not supported.
Associated programs
- Professional Certificate en Data Skills for Artificial Intelligence

Language English
Video Transcripts اَلْعَرَبِيَّةُ, Deutsch, English, Español, Français, हिन्दी, Bahasa Indonesia, Português, Kiswahili, తెలుగు, Türkçe, 中文
Associated skillsAlgorithms, Pandas (Python Package), Artificial Intelligence, Relational Databases, Machine Learning, Applications Of Artificial Intelligence, SQL (Programming Language), Medical Practices And Procedures, Management, Data Management, Data Engineering, Python (Programming Language), Jupyter

Lo que aprenderás

Omitir Lo que aprenderás

Why Data Management is central to AI applications
What kind of data these applications need
How to obtain data for AI applications
How to extract and query data from existing databases using SQL
How to setup your Python notebooks
How to use the pandas library to work with tabular data
How to visualize data using the Seaborn library

Plan de estudios

Omitir Plan de estudios

Week 1:

We ask why we should care about data management for Artificial Intelligence and Machine Learning (ML) systems.
We examine which data are needed in the ML lifecycle and what properties that data should have.
We discuss the effort and time needed for data management activities, and look at possible data sources.

Week 2:

The basic key concepts of data management, such as databases, data models and data schemas are all introduced.
The Relational Data Model is explained and contrasted with the Single-Table Model (like CSV and Excel) and Document Models.

Week 3:

We show how to extract data from existing relational databases using SQL queries and converting the query results into CSV files for further processing using pandas in Python notebooks.

Week 4:

The different ways setoff setting up and running Python notebooks are covered, including cloud-based notebooks and local notebooks.
We will take you step by step through the process of setting up your conda environment and installing Jupyter and pandas libraries.
You will learn how to run notebooks in VS code.

Week 5:

Become a pandas expert.
Explore the essential functionalities of pandas and, most importantly, write elegant and efficient Python pandas code to process and engineer tabular data.

Week 6:

You will learn how to make simple and clear scientific figures in Python using the Seaborn library.
Use the core functions provided by Seaborn to make beautiful statistical plots.

¿Quién puede hacer este curso?

Lamentablemente, las personas residentes en uno o más de los siguientes países o regiones no podrán registrarse para este curso: Irán, Cuba y la región de Crimea en Ucrania. Si bien edX consiguió licencias de la Oficina de Control de Activos Extranjeros de los EE. UU. (U.S. Office of Foreign Assets Control, OFAC) para ofrecer nuestros cursos a personas en estos países y regiones, las licencias que hemos recibido no son lo suficientemente amplias como para permitirnos dictar este curso en todas las ubicaciones. edX lamenta profundamente que las sanciones estadounidenses impidan que ofrezcamos todos nuestros cursos a cualquier persona, sin importar dónde viva.

Este curso es parte del programa Data Skills for Artificial Intelligence Professional Certificate

Más información

Instrucción por expertos

2 cursos de capacitación

A tu ritmo

Avanza a tu ritmo

3 meses

5 - 6 horas semanales

Formas de realizar este curso

Elige tu camino al inscribirte.

	Verified Track	Audit Track
Costo	149 US$	Free
Acceso a los materiales del curso	Ilimitado	Limitado
World class institutions and universities
Asistencia de edX
Certificado para compartir al finalizar
Tareas con calificación y exámenes

Visita la sección de Preguntas frecuentes con preguntas frecuentes sobre estas modalidades.

¿Te interesa este curso para tu negocio o equipo?

Capacita a tus empleados en los temas más solicitados con edX para Negocios.

Comprar Solicitar información

DelftX: AI Skills for Engineers: Data Engineering and Data Pipelines

AI Skills for Engineers: Data Engineering and Data Pipelines

Sobre este curso

De un vistazo

Lo que aprenderás

Plan de estudios

¿Quién puede hacer este curso?

Este curso es parte del programa Data Skills for Artificial Intelligence Professional Certificate

Formas de realizar este curso

Verified Track

Audit Track

Costo

Acceso a los materiales del curso

World class institutions and universities

Asistencia de edX

Certificado para compartir al finalizar

Tareas con calificación y exámenes

¿Te interesa este curso para tu negocio o equipo?