Lecture 0: Introduction

Author CS

Date March

Learning with Unstructured Data

Prof. Claudia Soares
claudia.soares@fct.unl.pt

Today

Course outline
Introduction to deep learning
Fundamentals of machine learning

Outline

Lecture 1.1: Introduction
Lecture 1.2: Fundamentals of machine learning
Lecture 2.1: Multi-layer perceptron
Lecture 2.2: Training neural networks
Lecture 3.1: Computer Vision
Lecture 3.2: Convolutional neural networks
Lecture 4: Attention and transformer networks
Lecture 5: Graph Neural Networks
Lecture 6: Natural Language Processing

Main Reference

Good balance of theory and practice
Covers almost all topics; Will be added more references, as adequate
Online book dl2.ai

My mission

By the end of this course, you will have acquired a solid and detailed understanding of the field of deep learning.

You will have learned how to design deep neural networks for a wide range of advanced probabilistic inference tasks and how to train them.

These models seen in the course apply to a wide variety of artificial intelligence problems, with plenty of applications in engineering and science.

Why learning?

???

Sheepdog or mop? ]

Chihuahua or muffin? ]

The (human) brain is so good at interpreting visual information that the gap between raw data and its semantic interpretation is difficult to assess intuitively:

This is a mushroom. ]

This is a mushroom.

This is a mushroom.

This is a mushroom.

Writing a computer program that sees?

.center.width-60[]

.center.width-60[]

.center.width-80[]

.center.width-80[]

Extracting semantic information requires models of high complexity, which cannot be designed by hand.

However, one can write a program that learns the task of extracting semantic information.

.center.width-70[]

The common approach used in practice consists of: - defining a parametric model with high capacity, - optimizing its parameters, by “making it work” on the training data.

Applications and successes

Object detection, pose estimation, segmentation (2019)

Reinforcement learning (Mnih et al, 2014)

Strategy games (Deepmind, 2016-2018)

Autonomous cars (NVIDIA, 2016)

Autonomous cars (Waymo, 2022)

Physics simulation (Sanchez-Gonzalez et al, 2020)

]

AI for Science (Deepmind, AlphaFold, 2020)

Speech synthesis and question answering (Google, 2018)

Image generation and AI art (OpenAI, 2022)

Write computer code (OpenAI, 2021)

.center.width-100[]

Answer all your questions (OpenAI, 2022)

Dali Lives (2019)

.italic[“ACM named .bold[Yoshua Bengio], .bold[Geoffrey Hinton], and .bold[Yann LeCun] recipients of the .bold[2018 ACM A.M. Turing Award] for conceptual and engineering breakthroughs that have made deep neural networks a critical component of computing.”]

Why does it work now?

.center.grid[ .kol-1-2[ Algorithms (old and new)

.width-90[]] .center.kol-1-2[ More data

.width-50[]]]

.center.grid[ .kol-1-2[ Software
.width-90[]] .kol-1-2[ Faster compute engines

.width-50[]]]

???

The success of deep learning is multi-factorial…

Building on the shoulders of giants

Five decades of research in machine learning provided - a taxonomy of ML concepts (classification, generative models, clustering, kernels, linear embeddings, etc.), - a sound statistical formalization (Bayesian estimation, PAC), - a clear picture of fundamental issues (bias/variance dilemma, VC dimension, generalization bounds, etc.), - a good understanding of optimization issues, - efficient large-scale algorithms.

Deep learning

From a practical perspective, deep learning - lessens the need for a deep mathematical grasp, - makes the design of large learning architectures a system/software development task, - allows to leverage modern hardware (clusters of GPUs), - does not plateau when using more data, - makes large trained networks a commodity.

.center.circle.width-30[]

.italic[For the last forty years we have programmed computers; for the next forty years we will train them.]

The end.