Search This Blog

Machine Learning

Decoder Architecture in Transformer

Get link
Facebook
X
Pinterest
Email
Other Apps

- July 06, 2025

One Decoder Block:

Detailed architecture:

softmax will generate probability of each word in vocabulary.

Get link
Facebook
X
Pinterest
Email
Other Apps

Comments

Extracting Tables and Text from Images Using Python

- September 22, 2024

In this blog, we'll explore a complete Python solution that detects and extracts tables and text from images using libraries like Transformers, OpenCV, PaddleOCR, and easyOCR. This step-by-step breakdown includes code to detect tables, extract content from individual table cells, and retrieve any remaining text in the image. Overview When working with scanned documents, such as invoices or forms, it is essential to accurately extract both structured information (like tables) and unstructured text. The approach we’ll explore uses Microsoft's pretrained object detection model to locate tables and OCR techniques to extract the text from both table cells and the rest of the image. Steps: 1. This code first detects table using microsoft's model. and save that image which contains detected table only 2. After that, from the detected table , we make a seperate image for each cell. 3. Then we read text from the image of each cell 4. Now, to read the extra texts except for the ...

Positional Encoding in Transformer

- July 05, 2025

1. Why Position Matters in Transformers? Transformers rely on self‑attention, which processes tokens in parallel. This means, unlike RNNs, they don’t inherently know the order of words. So, sentences like “Ravi killed the lion” vs. “The lion killed Ravi” would look identical to a vanilla Transformer—clearly problematic! 🧪 Idea #1: The Naïve Approach A simple fix would be to add index/position of the token in an embedding vector. Issues: Unbounded values: Position IDs can become huge (e.g. 100,000+ in long texts), destabilizing training. Discrete steps: Sharp jumps between integers disrupt gradient flow. 🧪 Idea #2: Normalize the Position Numbers What if we divide the position numbers by a constant to make them small and smooth? That helps a bit—values don’t explode anymore. Issues: Now, if you observe, in both the sentences, the word at second position has got the different values. 1 for sentence1, and 0.5 for sentence2. so, the Neural network will get confused while training, what a...

Chain Component in LangChain

- March 23, 2025

LangChain is a powerful framework for building applications with Large Language Models (LLMs). One of its core features is the Chain Component , which allows developers to link multiple operations together to create complex workflows. In this blog, we'll explore different types of chains in LangChain and demonstrate their usage with code examples. 1️⃣ LLMChain (Basic Single-Step Chain) 🔹 Executes a single LLM call based on a prompt. 🔹 The simplest chain type. ✅ Use Case: When you need a single prompt-response from an LLM. Example: 2️⃣ SequentialChain (Step-by-Step Execution) 🔹 Executes multiple chains in sequence where each step depends on the output of the previous step. ✅ Use Case: Multi-step processes where each step depends on the previous output . 📌 Example: Blog Outline → Content Expansion → Summary Example: 3️⃣ SimpleSequentialChain (Basic Sequential Execution) 🔹 Similar to SequentialChain, but only passes the last step’s output forward (less flexible). 🔹 ...

Labels

#AI
#BigData
#chains
#Clustering
#computerVision
#cross-validation
#CrossAttention
#DataAnalysis
#DataScience
#DBSCAN

#embeddings
#Encoder
#ensemblingTechniques
#Image Processing
#KMeans
#knn
#langchain
#langchaincomponents
#LayerNormalization
#llm
#logistic regression
#machine learning
#MachineLearning
#ml
#ml #decisiontree #classification
#ml #regression #decisiontree
#NaiveBayes
#objectorientedprograpping
#ocr
#oops
#OutlierDetection
#precision metrics
#RandomForest
#SemanticSearch
#train-test split
#Transformer
#TransformerInference
#Transformers
#VectorDatabase
Bias
Lasso Regression
Linear Regression
Machine Learning
ML
Overfitting
Ridge & Lasso Regression
Ridge Regression
supervised ML
underfitting
Variance

Show more Show less