Understand the BERT Transformer in and out.
Follow me on M E D I U M: towardsdatascience.com/likelihood-probability-and-…
Please subscribe to keep me alive: youtube.com/c/CodeEmporium?sub_confirmation=1
PLAYLISTS FROM MY CHANNEL
⭕ Reinforcement Learning: • Reinforcement Learning 101
Natural Language Processing: • Natural Language Processing 101
⭕ Transformers from Scratch: • Natural Language Processing 101
⭕ ChatGPT Playlist: • ChatGPT
⭕ Convolutional Neural Networks: • Convolution Neural Networks
⭕ The Math You Should Know : • The Math You Should Know
⭕ Probability Theory for Machine Learning: • Probability Theory for Machine Learning
⭕ Coding Machine Learning: • Code Machine Learning
MATH COURSES (7 day free trial)
📕 Mathematics for Machine Learning: imp.i384100.net/MathML
📕 Calculus: imp.i384100.net/Calculus
📕 Statistics for Data Science: imp.i384100.net/AdvancedStatistics
📕 Bayesian Statistics: imp.i384100.net/BayesianStatistics
📕 Linear Algebra: imp.i384100.net/LinearAlgebra
📕 Probability: imp.i384100.net/Probability
OTHER RELATED COURSES (7 day free trial)
📕 ⭐ Deep Learning Specialization: imp.i384100.net/Deep-Learning
📕 Python for Everybody: imp.i384100.net/python
📕 MLOps Course: imp.i384100.net/MLOps
📕 Natural Language Processing (NLP): imp.i384100.net/NLP
📕 Machine Learning in Production: imp.i384100.net/MLProduction
📕 Data Science Specialization: imp.i384100.net/DataScience
📕 Tensorflow: imp.i384100.net/Tensorflow
REFERENCES
[1] BERT main paper: arxiv.org/pdf/1810.04805.pdf
[1] BERT in google search: blog.google/products/search/search-language-unders…
[2] Overview of BERT: arxiv.org/pdf/2002.12327v1.pdf
[4] BERT word embeddings explained: medium.com/@_init_/why-bert-has-3-embedding-layers…
[5] More details of BERT in this amazing blog: towardsdatascience.com/bert-explained-state-of-the…
[6] Stanford lecture slides on BERT: nlp.stanford.edu/seminar/details/jdevlin.pdf