My Blog

Practical insights about Python, React, machine learning, and AI.

Aug 2025

Document Summarization: Eval First

Build an extractive/abstractive summarization workflow with dataset splits, ROUGE/BERTScore, and human eval checklists.

Aug 14

11 min read

Subhajit Bhar

NLP Entity Matching with Fuzzy Search

A practical guide to entity matching using TF-IDF, Jaro-Winkler, and embedding similarity with thresholds and evaluation.

Aug 14

10 min read

Subhajit Bhar

NLP Text Cleaning Pipeline

A practical, production-ready text cleaning pipeline for NLP tasks with regex rules, normalization, and evaluation hooks.

Aug 14

9 min read

Subhajit Bhar

LangChain: RAG Architecture & Code

RAG is a design pattern, not a product. LangChain supports it out of the box. This guide shows a production-ready RAG setup in LangChain with architecture, retrieval choices, runnable code, evaluation metrics, and trade-offs from my client projects.

Aug 2

12 min read

Subhajit Bhar

Jul 2025

LightRAG: Lean RAG with Benchmarks

LightRAG is a minimal RAG toolkit that strips away heavy abstractions. Here’s a complete build with code, performance numbers versus a LangChain baseline, and when LightRAG is the right choice.

Jul 30

12 min read

Subhajit Bhar

RAG for Knowledge-Intensive Tasks

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Jul 24

5 min read

Subhajit Bhar

On-Page vs Off-Page SEO

Learn the fundamental differences between on-page and off-page SEO strategies and how they work together to boost your search rankings

Jul 12

5 min read

Subhajit Bhar

Aug 2024

Pandas: Handle Missing Values

Complete guide to detecting, analyzing, and handling missing values in pandas DataFrames using fillna, interpolation, and advanced imputation techniques.

Aug 15

8 min read

Subhajit Bhar

Jul 2024

Outliers in Python: IQR and Z-Score

Complete guide to outlier detection and removal using IQR, Z-Score, and isolation forest methods with practical Python examples.

Jul 25

9 min read

Subhajit Bhar

Jun 2024

NumPy: reshape() vs flatten()

Learn when to use reshape() vs flatten() in NumPy for array manipulation, with practical examples and performance comparisons.

Jun 30

7 min read

Subhajit Bhar

Ready to Learn More?

Get practical tips about Python, React, machine learning, and AI delivered to your inbox.