Kursinhalt

Generative AI

1. Introduction to Generative AI

What is Generative AI?History and Evolution Types of Generative AI Models

2. Theoretical Foundations

Probability Distributions and Randomness in AI Bayesian Inference and Markov Processes Understanding Information and Optimization in AI Overview of Artificial Neural Networks Recurrent Neural Networks (RNNs) and Sequence Generation Variational Autoencoders (VAEs)Generative Adversarial Networks (GANs)Transformer-Based Generative Models Diffusion Models and Probabilistic Generative Approaches

3. Building and Training Generative Models

Data Collection and Preprocessing Training and Optimization Evaluation Metrics for Generative AI Challenge: Build Simple VAE

4. Ethical, Regulatory, and Future Perspectives in Generative AI

Bias, Fairness, and Representation Deepfakes and Misinformation Intellectual Property and Ownership Sustainability and Scaling Challenges Global Policy and AI Governance

Overview of Artificial Neural Networks

Artificial Neural Networks (ANNs) are the backbone of modern Generative AI. They are designed to recognize patterns, learn representations, and generate data that mimics real-world distributions. You'll gain a concise and comprehensive overview of ANNs, emphasizing their significance in Generative AI.

Structure of Neural Networks

Neurons and Layers

A neural network consists of interconnected units called neurons, which are organized into layers:

Input Layer: receives raw data (e.g., images, text, numerical inputs);
Hidden Layers: process and transform data using weighted connections;
Output Layer: produces predictions or classifications.

Each neuron applies a weighted sum to its inputs and passes the result through an activation function:

z=\sum^n_{i=1}\omega_ix_i+b

where:

$x_i$ are input values;
$\omega_i$ are weights;
$b$ is the bias term;
$z$ is the weighted sum passed to the activation function.

Activation Functions

Activation functions introduce non-linearity, enabling networks to learn complex patterns. Common activation functions include:

Sigmoid, used for probabilities: $\sigma(z)=\dfrac{1}{1+e^{-z}}$

ReLU (Rectified Linear Unit), commonly used in deep networks: $f(z)=\max(0,z)$

Tanh, useful for zero-centered outputs: $\tanh(z)=\dfrac{e^z-e^{-z}}{e^z+e^{-z}}$

Forward and Backward Propagation

Forward Propagation

Forward propagation refers to passing inputs through the network to compute the output. Each neuron computes:

a=f(z)=f\left( \sum^n_{i=1}\omega_i x_i + b \right)

where $f(z)$ is the activation function.

Backpropagation and Gradient Descent

To improve predictions, ANNs adjust weights using backpropagation, which minimizes error using gradient descent. The weight update rule in gradient descent is:

\omega^{(t+1)}_i=\omega^{(t)}_i - \eta *\frac{\partial L}{\partial \omega_i}

where:

$\eta$ is the learning rate;
$L$ is the loss function;
$\frac{\partial L}{\partial \omega_i}$ is the gradient of the loss with respect to $\omega_i$ .

Loss Functions and the Training Process

Loss Functions

Loss functions measure the difference between predicted and actual values. Common loss functions include:

Mean Squared Error (MSE) (for regression):

\text{MSE}=\frac{1}{n}\sum^n_{i=1}(y_i-\hat{y}_i^2)

Cross-Entropy Loss (for classification):

\text{L}=-\sum^n_{i=1}y_i\log(\hat{y}_i)

where:

$y_i$ is the true label;
$\hat{y}_i$ is the predicted probability.

Training Process

Initialize weights randomly;
Perform forward propagation to compute predictions;
Compute the loss using the chosen loss function;
Use backpropagation to compute weight updates;
Update weights using gradient descent;
Repeat for multiple epochs until the network converges.

The Universal Approximation Theorem and Deep Learning

Universal Approximation Theorem

The Universal Approximation Theorem states that a neural network with at least one hidden layer can approximate any continuous function, given sufficient neurons and proper weights. This justifies why ANNs can model highly complex relationships.

Deep Learning and Its Significance

Deep Learning extends ANNs by adding many hidden layers, allowing them to:

Extract hierarchical features (useful in image processing and NLP);
Model complex probability distributions (critical for Generative AI);
Learn without manual feature engineering (as seen in self-supervised learning).

Conclusion

This chapter introduced the core principles of ANNs, emphasizing their structure, learning process, and significance in deep learning. These concepts lay the foundation for advanced Generative AI techniques like GANs and VAEs, which rely on neural networks to generate realistic data.

1. Which of the following is NOT a component of an artificial neural network?

2. What is the primary purpose of backpropagation in neural networks?

3. The Universal Approximation Theorem states that a sufficiently large neural network can approximate which of the following?

Which of the following is NOT a component of an artificial neural network?

Select the correct answer

Neurons

Layers

Activation Functions

Data Compression

What is the primary purpose of backpropagation in neural networks?

Select the correct answer

To initialize the neural network

To update weights by minimizing loss

To increase the size of the network

To perform forward propagation

The Universal Approximation Theorem states that a sufficiently large neural network can approximate which of the following?

Select the correct answer

Any continuous function

Any discrete function

Only linear functions

Only polynomial functions

War alles klar?

Wie können wir es verbessern?

Danke für Ihr Feedback!

Abschnitt 2. Kapitel 4

Fragen Sie AI

Fragen Sie alles oder probieren Sie eine der vorgeschlagenen Fragen, um unser Gespräch zu beginnen

Kursinhalt

Generative AI

1. Introduction to Generative AI

What is Generative AI?History and Evolution Types of Generative AI Models

2. Theoretical Foundations

3. Building and Training Generative Models

Data Collection and Preprocessing Training and Optimization Evaluation Metrics for Generative AI Challenge: Build Simple VAE

4. Ethical, Regulatory, and Future Perspectives in Generative AI

Bias, Fairness, and Representation Deepfakes and Misinformation Intellectual Property and Ownership Sustainability and Scaling Challenges Global Policy and AI Governance

Overview of Artificial Neural Networks

Structure of Neural Networks

Neurons and Layers

A neural network consists of interconnected units called neurons, which are organized into layers:

Input Layer: receives raw data (e.g., images, text, numerical inputs);
Hidden Layers: process and transform data using weighted connections;
Output Layer: produces predictions or classifications.

Each neuron applies a weighted sum to its inputs and passes the result through an activation function:

z=\sum^n_{i=1}\omega_ix_i+b

where:

$x_i$ are input values;
$\omega_i$ are weights;
$b$ is the bias term;
$z$ is the weighted sum passed to the activation function.

Activation Functions

Activation functions introduce non-linearity, enabling networks to learn complex patterns. Common activation functions include:

Sigmoid, used for probabilities: $\sigma(z)=\dfrac{1}{1+e^{-z}}$

ReLU (Rectified Linear Unit), commonly used in deep networks: $f(z)=\max(0,z)$

Tanh, useful for zero-centered outputs: $\tanh(z)=\dfrac{e^z-e^{-z}}{e^z+e^{-z}}$

Forward and Backward Propagation

Forward Propagation

Forward propagation refers to passing inputs through the network to compute the output. Each neuron computes:

a=f(z)=f\left( \sum^n_{i=1}\omega_i x_i + b \right)

where $f(z)$ is the activation function.

Backpropagation and Gradient Descent

To improve predictions, ANNs adjust weights using backpropagation, which minimizes error using gradient descent. The weight update rule in gradient descent is:

\omega^{(t+1)}_i=\omega^{(t)}_i - \eta *\frac{\partial L}{\partial \omega_i}

where:

$\eta$ is the learning rate;
$L$ is the loss function;
$\frac{\partial L}{\partial \omega_i}$ is the gradient of the loss with respect to $\omega_i$ .

Loss Functions and the Training Process

Loss Functions

Loss functions measure the difference between predicted and actual values. Common loss functions include:

Mean Squared Error (MSE) (for regression):

\text{MSE}=\frac{1}{n}\sum^n_{i=1}(y_i-\hat{y}_i^2)

Cross-Entropy Loss (for classification):

\text{L}=-\sum^n_{i=1}y_i\log(\hat{y}_i)

where:

$y_i$ is the true label;
$\hat{y}_i$ is the predicted probability.

Training Process

Initialize weights randomly;
Perform forward propagation to compute predictions;
Compute the loss using the chosen loss function;
Use backpropagation to compute weight updates;
Update weights using gradient descent;
Repeat for multiple epochs until the network converges.

The Universal Approximation Theorem and Deep Learning

Universal Approximation Theorem

Deep Learning and Its Significance

Deep Learning extends ANNs by adding many hidden layers, allowing them to:

Extract hierarchical features (useful in image processing and NLP);
Model complex probability distributions (critical for Generative AI);
Learn without manual feature engineering (as seen in self-supervised learning).

Conclusion

1. Which of the following is NOT a component of an artificial neural network?

2. What is the primary purpose of backpropagation in neural networks?

3. The Universal Approximation Theorem states that a sufficiently large neural network can approximate which of the following?

Which of the following is NOT a component of an artificial neural network?

Select the correct answer

Neurons

Layers

Activation Functions

Data Compression

What is the primary purpose of backpropagation in neural networks?

Select the correct answer

To initialize the neural network

To update weights by minimizing loss

To increase the size of the network

To perform forward propagation

The Universal Approximation Theorem states that a sufficiently large neural network can approximate which of the following?

Select the correct answer

Any continuous function

Any discrete function

Only linear functions

Only polynomial functions

War alles klar?

Wie können wir es verbessern?

Danke für Ihr Feedback!

Abschnitt 2. Kapitel 4