NLP: Foundations to Advance

Explore the fascinating world of Natural Language Processing, from linguistic foundations to modern transformer models and real-world applications.

Mayank Sharma 4 articles
Back to All Series
1

Text Processing: Tokenization, Stemming, Lemmatization

Feb 01, 2026

Master the foundational blocks of NLP: how to break down text into tokens and reduce them to their root forms using stemming and lemmatization.

Read Article
2

Text Representation: From Bag-of-Words to TF-IDF

Feb 03, 2026

Learn how to transform raw text into numerical vectors that machines can understand using Bag-of-Words and TF-IDF techniques.

Read Article
3

Word Embeddings from Scratch: CBOW and Skip-Gram

Feb 05, 2026

Learn how Word2Vec learns dense word vectors by implementing both CBOW and Skip-Gram from scratch in a single hands-on Jupyter notebook.

Read Article
4

Advanced Word Embeddings from Scratch: GloVe and FastText

Feb 07, 2026

Learn how advanced embeddings go beyond Word2Vec by implementing a toy GloVe model and a FastText-style subword model from scratch in a Jupyter notebook.

Read Article