Skip to main content

Command Palette

Search for a command to run...

From Full Stack to AI: Core Fundamentals of Generative AI

Understanding Generative AI: Key Fundamentals from Full Stack to AI

Published
4 min read
From Full Stack to AI: Core Fundamentals of Generative AI
P

I’m a full-stack developer who enjoys building practical, scalable applications with React.js, Node.js, and Next.js. My journey into open source started with Hacktoberfest 2023, and it opened the door to real collaboration, learning from global contributors, and supporting early developers as they grow.

Since then, I’ve contributed to and mentored in programs like GSSoC’24, SSOC’24, and C4GT’24. As a Google Gen AI Exchange Hackathon ’24 Finalist and a Google Women Techmakers Ambassador, I’ve had the chance to help communities explore AI and build meaningful solutions. I’m also part of the Top 1% mentors on Topmate, where I guide students on open source, career building, and technical growth.

My work has been featured at Times Square NYC, and I’ve spoken on international podcasts about tech, learning, and community. I’ve also written technical content for CoderArmy and continue to share insights through articles and public posts. LinkedIn has recognized my work with seven Top Voice badges as well as Golden Badges in research, critical thinking, teamwork, and interpersonal skills.

I completed my MCA from Chandigarh University in 2023 and continue to stay curious by exploring AI, building new projects, and contributing to developer communities. Whether it’s improving a UI, debugging backend logic, or helping someone with their first pull request, I enjoy learning alongside others.

If you want to collaborate, learn together, or discuss an idea, feel free to reach out at kumaripayal7488@gmail.com

From a full stack background, AI can feel confusing at first. There are new words, new math, and new ways of thinking. But at its core, generative AI is still software. It takes input, processes it step by step, and gives output. In this chapter, I am writing what I wish I had when I started. A simple, practical explanation of how Large Language Models work under the hood.


1. Understanding Large Language Models (LLMs)

What is an LLM?

LLM stands for Large Language Model. It is a program trained to understand and generate human language. You give it text as input, and it predicts what text should come next.

Think of it like autocomplete on steroids. Instead of predicting the next word in a sentence, it predicts the next token based on everything it has learned.

How does an LLM work at a high level?

  1. Text is converted into tokens

  2. Tokens are converted into numbers

  3. A neural network processes those numbers

  4. The model predicts the next token

  5. Tokens are converted back into text

This loop runs again and again until the final response is formed.

2. Deep Dive into the GPT Architecture

What does GPT mean?

GPT stands for:

  • Generative: It creates new text

  • Pre-trained: It is trained on large text data before you use it

  • Transformer: It is built using the transformer architecture

These models all follow the same core idea:

  • GPT by OpenAI

  • Gemini by Google

  • Claude by Anthropic

  • Mistral by Mistral AI

They differ in size, training data, and optimizations, but the base architecture is similar.

Why transformers matter

Transformers allow models to understand context. They look at all words in a sentence at the same time instead of one by one. This is why modern AI feels much smarter than older models.

3. How LLMs Work Under the Hood

Why do LLMs need GPUs?

LLMs perform a massive number of matrix calculations. CPUs are not designed for this level of parallel math. GPUs are.

When training or running large models:

  • Millions of parameters are updated

  • Large matrices are multiplied

  • Everything must happen fast

That is why GPUs are essential for LLMs.

4. Fundamentals of Tokenization in NLP

What is tokenization?

Tokenization is the process of breaking text into smaller units called tokens. Tokens are not always words. Sometimes they are parts of words.

Example:
Text: "Hey There!"
Tokens might be:

  • "Hey"

  • "There"

  • "!"

Encode and decode

  • Encoding converts text into numbers

  • Decoding converts numbers back into text

This is how models understand language.

5. Implementing a Custom Tokenizer in Python

Python example using tiktoken

import tiktoken

enc = tiktoken.encoding_for_model("gpt-4o")
text = "Hey There! My name is Payal Kumari"

tokens = enc.encode(text)
print("Tokens:", tokens)

# Tokens: [25216, 3274, 0, 3673, 1308, 382, 11961, 280, 81689, 1683]

decoded = enc.decode(tokens)
print("Decoded", decoded)

Output

Tokens: [25216, 3274, 0, 3673, 1308, 382, 11961, 280, 81689, 1683]
Decoded Hey There! My name is Payal Kumari

Explanation

Each number represents a token. The model never sees text directly. It only works with numbers. Decoding converts those numbers back into readable text.

6. The Transformer Breakthrough: Google’s Attention Paper

Transformers were introduced in the paper "Attention Is All You Need" by Google.

Core idea

Instead of processing words one by one, the model pays attention to all words at the same time.

This allows it to understand:

  • Long sentences

  • Relationships between words

  • Context across paragraphs

7. Deep Diving into Vector Embeddings

What are embeddings?

Embeddings are numeric representations of words or sentences. Similar meanings result in similar vectors.

Example:

  • "cat" is closer to "dog"

  • "Paris" is closer to "France"

This allows models to understand meaning, not just text.

Real-world use

  • Search engines

  • Recommendation systems

  • Semantic search

8. Role of Positional Encodings in Transformers

Why position matters

Transformers do not understand word order by default. Positional encoding adds information about where a word appears in a sentence.

Example:

  • "Dog bites man"

  • "Man bites dog"

Same words, different meaning. Positional encoding solves this.

9. Understanding Multi-Head Attention for Rich Context

What is multi-head attention?

Instead of one attention mechanism, transformers use multiple heads.

Each head focuses on a different relationship:

  • Grammar

  • Meaning

  • Long-distance connections

This helps the model understand language more deeply.


Closing Thoughts

Moving from full stack development to AI is not about memorizing everything at once. It is about understanding the flow. Tokens to numbers. Numbers through transformers. Predictions back to text. Build step by step, and the concepts will start clicking.

Documenting my Full Stack → AI journey, step by step.

By Payal Kumari

From Full Stack to AI: Learning in Public

Part 11 of 25

In this series, I share my journey of learning AI and LLM engineering as a Full Stack Developer. From Python basics to real AI apps, this is a learning-in-public series with honest insights from a MERN developer transitioning into AI. By Payal Kumari

Up next

From Full Stack to AI: API Setup and Integration

From Full Stack to AI: API Setup and Integration