Leyton Internal Prep · BCG X Technical Interview · Checklist · DSA · Tonight

Mock Interview
BCG X

// 5 phases · 55 min format · full model answers included

📋 Session — Candidat & Feedback Aucune session

▶

Nom du candidat

Date

Poste visé

Interviewer

Exercice utilisé

Évaluation globale

Points forts

Points à améliorer

Phase 01 · ~8 min

Warm-up — Question Bank

Mental Math

Fermi

Logic

Pick 1 from each category per session. Rotate across mock interviews so candidates don't share answers.

🔢 Mental Math 4 questions · pick 1

▶

Mental Math · M1

Compute 840 ÷ 1200. Show your decomposition step by step — no direct answer.

÷10 both sides: 840/1200 = 84/120

÷12 both sides: 84/120 = 7/10

7/10 = 0.7

Mental Math · M2

Compute 1260 ÷ 4200. Decompose step by step.

÷100 both sides: 1260/4200 = 12.6/42

Simpler: divide by 420 → 3/10

3/10 = 0.3

Alternative path: 1260/4200 = 126/420 = 63/210 = 21/70 = 3/10 = 0.3. Multiple decomposition paths are valid — what matters is showing the work.

Mental Math · M3

Estimate √2700. Walk me through your reasoning using known squares.

Anchor: 50² = 2500, close to 2700.

Try 52: 52² = (50+2)² = 2500 + 200 + 4 = 2704. Off by only 4.

√2700 ≈ 52

Memorize key squares: 40²=1600, 45²=2025, 50²=2500, 55²=3025, 60²=3600.

Mental Math · M4

What is 17% of 350? No calculator — show your breakdown.

10% of 350 = 35

5% of 350 = 17.5

2% of 350 = 7

17% = 10% + 5% + 2% = 35 + 17.5 + 7 = 59.5

Decompose into 10% + 5% + 2% increments. This pattern applies to any percentage question.

🌍 Fermi Estimation 4 questions · pick 1

▶

Fermi · F1

How many taxis are there in Casablanca? Walk me through your reasoning.

Population: ~4M inhabitants, ~1M households.

Demand: ~20% use taxi once/week → 200k trips/week → ~30k trips/day.

Supply: 1 taxi does ~12 trips/day → 30k/12 ≈ 2,500 formal taxis. Add informal sector → ~4,000–5,000 total.

Structure: population → demand → capacity → result. Always sanity-check your number at the end.

Fermi · F2

How many software developers are there in Morocco?

Population: ~37M. Working age (~15–60): ~55% = ~20M people.

Workforce with higher education: ~15% = ~3M. STEM graduates: ~20% of that = ~600k.

CS/IT specifically: ~25% of STEM = ~150k. Active developers (not all find dev jobs): ~60% = ~90,000–100,000.

Cross-check: Morocco has ~50 universities, ~2,000 IT graduates/year × 20 active years ≈ 40k–80k. Both paths converge around 80–100k — good sign.

Fermi · F3

How many WhatsApp messages are sent per day in Morocco?

Smartphone users: ~37M population, ~70% smartphone penetration = ~26M users.

WhatsApp adoption: Morocco is one of the highest globally — ~90% of smartphone users = ~23M active users.

Messages per user per day: average ~30 messages/day (mix of personal, family groups, professional).

23M × 30 = ~700 million messages/day.

Fermi · F4

How much data does a Netflix stream generate per hour, and how much total data does Netflix serve globally per day?

Per stream: HD = ~3 GB/hour, 4K = ~7 GB/hour. Use ~3 GB as the average (mix of HD/SD).

Concurrent streams globally: Netflix has ~260M subscribers. Peak concurrency ~15% = ~40M simultaneous streams.

Average watch time: ~2h/subscriber/day. Total: 260M × 2h × 3 GB/h = ~1.5 exabytes/day.

This tests whether the candidate can reason about scale. Netflix actually represents ~15% of global internet traffic — a good sanity check anchor.

🧩 Logic Puzzles 4 questions · pick 1

▶

Logic · L1

You have 8 identical balls. One is slightly heavier. Using a balance scale, what's the minimum number of weighings to find it?

Split 3 / 3 / 2. Weigh the two groups of 3.

If one side heavier → heavy ball is in that 3. If balanced → in the group of 2.

Second weighing identifies the ball in either case.

Answer: 2 weighings. Key: divide into 3 groups not 2 — log₃ thinking.

Logic · L2

You have two ropes. Each takes exactly 60 minutes to burn, but they burn unevenly. How do you measure exactly 45 minutes?

Light rope A from both ends simultaneously, and light rope B from one end.

Rope A finishes in 30 min (burned from both ends). At that moment, light the other end of rope B.

Rope B had 30 min left. Burning from both ends now → it finishes in 15 more min. Total: 30 + 15 = 45 minutes.

Classic BCG puzzle. The insight is: lighting both ends of a rope halves its remaining burn time, regardless of unevenness.

Logic · L3

A bat and a ball cost 1.10€ total. The bat costs 1€ more than the ball. How much does the ball cost?

The intuitive (wrong) answer is 0.10€. The correct answer is 0.05€.

Let ball = x. Bat = x + 1.00.

x + (x + 1.00) = 1.10 → 2x = 0.10 → x = 0.05.

Ball = 0.05€, Bat = 1.05€. Check: 1.05 - 0.05 = 1.00 ✓

This is a System 1 vs System 2 thinking test. BCG uses it to see if the candidate pauses to verify their intuition rather than blurting out the first answer.

Logic · L4

You're in a room with 3 light switches. One controls a bulb in a closed room. You can only enter the room once. How do you identify which switch controls the bulb?

Turn on switch 1 and wait 5 minutes.

Turn switch 1 OFF, turn switch 2 ON.

Enter the room: if the bulb is on → switch 2. If off but warm → switch 1. If off and cold → switch 3.

Exploits a physical property (heat) beyond just on/off. BCG values candidates who use all available information — not just the obvious binary signals.

Phase 02 · ~10 min

Fundamentals — Question Bank

🐳

DevOps / Git

🧱

Data Structures

🤖

AI / LLM

⚙️

Principles

Pick 2–3 questions across different categories per session. Mix categories based on the candidate's CV profile.

🐳 DevOps · Git · Docker 4 questions

▶

DevOps · D1

What is the difference between a Docker Image and a Docker Container?

Image — static, read-only blueprint built at compile time. Like a class in OOP. Immutable, layered (OS + deps + app code).

Container — a running instance of an image. Like an object instantiated from a class. Has its own writable layer, process space, and network.

Multiple containers can run from the same image simultaneously
Analogy: Image = recipe, Container = dish being cooked
Containers are ephemeral — data is lost on stop unless mounted to a volume

DevOps · D2

Explain CI/CD. What's the difference between Continuous Delivery and Continuous Deployment?

CI (Continuous Integration): every push triggers automated tests + build. Catches integration issues early. Tools: GitHub Actions, Jenkins, CircleCI.

Continuous Delivery: build is always deployable, but a human approves the push to prod
Continuous Deployment: every green build auto-deploys to prod — no human gate

BCG context: a solid CI/CD pipeline reduces deployment risk and enables the rapid iteration speed BCG X expects.

DevOps · D3

What is the difference between git merge and git rebase? When would you use each?

git merge creates a merge commit that joins two branch histories. Original history preserved.

git rebase replays your commits on top of the target branch, rewriting history to appear linear.

	merge	rebase
History	Preserves divergence	Linear / cleaner
Commits	Adds a merge commit	Rewrites commits (new SHAs)
Use when	Team branches, PRs	Local cleanup before push

Never rebase a branch already pushed and shared — you rewrite SHAs and break other devs' local copies.

DevOps · D4

What does git reset --hard do vs git revert? Which is safe on a shared branch?

git reset --hard HEAD~1 — moves the branch pointer back, discards the commit. Destructive. Rewrites history.

git revert <SHA> — creates a new commit that undoes a previous commit. Non-destructive. History preserved.

reset --hard → safe only on local, unpublished commits
revert → always safe, including on shared/main branches
In production: always use revert — it's auditable and reversible

🧱 Data Structures 4 questions

▶

DS · S1

What is the difference between a List, a Set, and a Dictionary in Python? When would you choose each?

Structure	Ordered	Duplicates	Lookup	Use case
List	✅	✅	O(n)	Ordered sequence, indexed access
Set	❌	❌	O(1)	Membership test, deduplication
Dict	✅ (3.7+)	Keys: ❌	O(1)	Key→value mapping, frequency count

# List — ordered, indexable, allows duplicates
temps = [3, -5, 3, 0]

# Set — fast membership, no order, no dupes
visited = {"nodeA", "nodeB"}

# Dict — key→value, O(1) lookup
freq = {"python": 3, "data": 2}

DS · S2

What is the difference between a HashMap and a HashSet? How are they implemented internally?

HashMap stores key → value pairs. HashSet stores only keys — internally it's a HashMap where the value is a dummy constant.

	HashMap	HashSet
Stores	key + value	key only
Lookup	O(1) avg	O(1) avg
Use when	Count, cache, index	Visited, dedup, membership

Both use a hash table. Hash function → bucket index. Collisions: chaining (linked list per bucket) or open addressing (Python uses open addressing with pseudo-random probing).

Dict lookup is O(1) average, O(n) worst case with hash collisions — but negligible in practice with Python's built-in types.

DS · S3

What is the difference between a Stack and a Queue? Implement a Stack using only two Queues.

Stack — LIFO. push/pop from same end. Use: undo/redo, call stack, bracket validation.

Queue — FIFO. enqueue at back, dequeue from front. Use: BFS, task scheduling, Kafka.

from collections import deque

class StackFromQueues:
    def __init__(self): self.q = deque()

    def push(self, x):
        self.q.append(x)
        for _ in range(len(self.q) - 1):
            self.q.append(self.q.popleft())  # rotate newest to front

    def pop(self): return self.q.popleft()  # O(1)
# push O(n), pop O(1)

DS · S4

What is the difference between an Array and a Linked List? When is each preferred?

Operation	Array	Linked List
Index access	O(1)	O(n)
Search	O(n)	O(n)
Insert at head	O(n) shift	O(1)
Insert at tail	O(1) amortized	O(n) / O(1) with tail ptr
Memory	Contiguous, cache-friendly	Scattered, pointer overhead

Python's list is a dynamic array, not a linked list. Use collections.deque for O(1) head insertions.

🤖 AI · LLM · LangChain · RAG · Agents 5 questions

▶

AI · A1

What is LangChain and what problem does it solve vs calling the OpenAI API directly?

LangChain is an orchestration framework for LLM-powered apps. Direct API = single prompt → single response. LangChain adds: Chains (multi-step pipelines), Agents (LLM-driven tool selection), Memory (conversation history), Tools (external integrations), Prompt templates.

Analogy: direct API = raw SQL. LangChain = ORM. Higher abstraction, faster to build, less fine-grained control.

AI · A2

Explain the ReAct pattern. What does the loop look like and why is it powerful?

ReAct (Reasoning + Acting): the LLM alternates Thought → Action → Observation until it can produce a Final Answer.

Thought: "I need to find the population of Casablanca"

Action: search("Casablanca population 2024")

Observation: "4.27 million"

Final Answer: generated from accumulated observations

Loop stops at max_iterations to prevent infinite loops
Agent chooses tools based on their text descriptions — no hardcoded flow
Powerful because it's dynamic: the LLM decides what info it needs at runtime

AI · A3

What is RAG? Describe the pipeline and name 3 limitations.

RAG retrieves relevant docs at query time and injects them into the LLM prompt — solving the knowledge cutoff problem.

Index (offline): chunk → embed → store in vector DB (FAISS, Pinecone, Chroma)

Retrieve (online): embed query → cosine similarity → top-k chunks

Generate: LLM answers using retrieved chunks as context

Chunk boundary splits lose cross-sentence context
Wrong retrieval → hallucination (garbage in, garbage out)
Context window limit caps how many chunks can be injected
Index staleness when source documents update

"RAG replaces BM25 sparse retrieval with dense vector similarity — same architecture, different distance metric."

AI · A4

What is the difference between an LLM Agent and a simple LLM Chain? Give a concrete production example.

	Chain	Agent
Flow	Fixed at code time	LLM decides at runtime
Latency	Lower	Higher (multi-step)
Reliability	High	Variable
Use when	Known, repeatable task	Open-ended, multi-hop reasoning

Example: data analyst agent — "Why did revenue drop in Q3?" → agent decides to: query DB, run correlation, search external news, then synthesize. A chain can't do this because the steps are unknown in advance.

AI · A5

What is TF-IDF, how does BM25 improve it, and why does log(1) = 0 matter in the formula?

TF-IDF: TF(t,d) × log(N / (1 + df(t))). Scores term importance by frequency × rarity.

BM25 adds: (1) TF saturation — diminishing returns on repeated terms. (2) Length normalization — penalizes long documents.

log(1) = 0 → term in every doc gets IDF = 0. Correct: it's a stopword ("the", "is")
log(0) = −∞ → +1 in denominator prevents this when df = 0

BCG often shows the BM25 formula and asks candidates to spot edge cases — the +1 guard is one of them.

⚙️ SOLID · ACID · REST vs GraphQL · SQL vs NoSQL 4 questions

▶

Principles · P1

What are the SOLID principles? One sentence each + one concrete code example.

Letter	Principle	One-liner
S	Single Responsibility	A class has only one reason to change
O	Open/Closed	Open for extension, closed for modification
L	Liskov Substitution	Subclasses must be usable wherever the parent is used
I	Interface Segregation	Don't force classes to implement interfaces they don't use
D	Dependency Inversion	Depend on abstractions, not concrete implementations

# SRP violation — one class doing too much
class Report:
    def generate(self): ...
    def save_to_db(self): ...  # ← different responsibility
    def send_email(self): ...  # ← different responsibility

# SRP fix — one class per responsibility
class ReportGenerator:  def generate(self): ...
class ReportRepository: def save(self, r): ...
class ReportNotifier:   def send(self, r): ...

Principles · P2

What are the ACID properties? Give a real-world violation example for each.

Property	Meaning	Violation example
Atomicity	All or nothing	Bank transfer: debit succeeds, credit crashes → money disappears
Consistency	DB stays in valid state	Order references a deleted customer → orphaned FK record
Isolation	Concurrent txns don't interfere	Dirty read: user sees another's uncommitted changes
Durability	Committed data survives crashes	Power cut after commit — data gone if not persisted to disk

ACID = SQL databases (PostgreSQL, MySQL). NoSQL often trades ACID for availability + partition tolerance (CAP theorem).

Principles · P3

What is the difference between SQL and NoSQL? When would you choose each in a BCG X context?

	SQL	NoSQL
Schema	Fixed, enforced	Flexible / schema-less
Scaling	Vertical	Horizontal
Transactions	Full ACID	Eventual consistency
Examples	PostgreSQL, MySQL	MongoDB, Redis, Cassandra

SQL: financial data, structured reporting, complex joins, compliance
NoSQL: high-volume writes, flexible schemas, caching (Redis), real-time feeds, vector DBs

BCG X example: ESG platform uses PostgreSQL for structured company data, Redis to cache API responses, and FAISS (vector DB) for the RAG layer.

Principles · P4

What is the difference between REST and GraphQL? What problem does GraphQL solve?

REST — resource-based endpoints returning fixed shapes. Problems: over-fetching (too many fields) and under-fetching (need multiple requests).

GraphQL — single endpoint, client defines exactly which fields it needs.

	REST	GraphQL
Endpoints	Many (/users, /posts…)	One (/graphql)
Over-fetching	Common	Eliminated
Type system	Optional (OpenAPI)	Built-in schema
Caching	Easy (HTTP)	Harder (POST requests)
Best for	Simple CRUD APIs	Complex nested data, mobile

Phase 03 · ~20 min

Live Coding — Problem Solving

Exercises

Pick 1

Per Candidate

Minutes

Always follow this sequence: Clarify assumptions → Brute force → Optimize → Complexity → Edge cases → Narrate. Magic phrase: "Let me first start with a straightforward approach, then I'll optimize it."

Problem A · Top K Frequent Elements

Medium

Given an array of integers nums and an integer k, return the k most frequent elements. The result must be sorted in descending order of frequency. If two elements have the same frequency, sort them in ascending order of value.

Example: nums = [1,1,1,2,2,3], k = 2 → [1, 2]
Example: nums = [4,4,2,2,3], k = 2 → [2, 4] (tie in freq → ascending value)

Step 1 — Clarify Assumptions

Is k always valid? (1 ≤ k ≤ number of unique elements) → assume yes
Can there be negative numbers? → yes, hashmap handles it fine
What if multiple elements tie in frequency? → sort them ascending by value (good edge case to raise proactively)
Should the output preserve original order? → no, sort by frequency descending

Step 2 — Brute Force

Count frequencies naively with a loop, then sort all unique elements by frequency, take the first k.

def top_k_brute(nums, k):
    freq = {}
    for n in nums:
        freq[n] = freq.get(n, 0) + 1  # build manually

    # sort all unique elements by freq descending
    sorted_keys = sorted(freq.keys(),
                          key=lambda x: -freq[x])
    return sorted_keys[:k]
# Time: O(n + u log u)  u = unique elements  Space: O(u)

Step 3 — Optimize: HashMap + Sorted with Tuple Key

Use collections.Counter to build the frequency map in O(n), then sort with a tuple key to handle ties — (-freq, value) sorts by frequency descending, then value ascending.

from collections import Counter

def top_k_frequent(nums: list[int], k: int) -> list[int]:
    # Step 1: build frequency map — O(n)
    freq = Counter(nums)
    # freq = {1: 3, 2: 2, 3: 1} for [1,1,1,2,2,3]

    # Step 2: sort by (-freq, value) — handles ties cleanly
    sorted_elements = sorted(freq.keys(),
                              key=lambda x: (-freq[x], x))

    # Step 3: return top k
    return sorted_elements[:k]
# Time: O(n + u log u)  Space: O(u)

Step 4 — Dry Run

nums = [4,4,2,2,3], k = 2

freq = {4: 2, 2: 2, 3: 1}

sort key (-freq, value):
  4 → (-2, 4)
  2 → (-2, 2)
  3 → (-1, 3)

sorted: [2, 4, 3]   ← 2 before 4 because (-2,2) < (-2,4)

return [:2] → [2, 4]  ✓  (tie broken by ascending value)

Step 5 — Complexity

Approach	Time	Space	Notes
Brute force (manual loop)	O(n + u log u)	O(u)	Same asymptotically but no tie handling
Counter + sorted (optimal)	O(n + u log u)	O(u)	Clean, handles ties, Pythonic
Heap (most_common)	O(n + u log k)	O(u)	Faster when k << u

u = number of unique elements. In the worst case u = n (all elements distinct).

Step 6 — Edge Cases

k = 1 → returns the single most frequent element ✓
All elements identical → freq = {x: n}, k = 1 → [x] ✓
All elements unique → every freq = 1, sort ascending by value, return first k ✓
Negative numbers → hashmap handles, tuple sort still works ✓

Interviewer Follow-up Questions

Follow-up 1

You used sorted() which is O(u log u). Can you do better when k is much smaller than u?

Yes — use a min-heap of size k. Push elements one by one; if the heap exceeds k, pop the minimum. Final heap contains the top k.

import heapq

def top_k_heap(nums, k):
    freq = Counter(nums)
    # heap of (freq, value) — min-heap on freq
    heap = []
    for val, cnt in freq.items():
        heapq.heappush(heap, (cnt, val))
        if len(heap) > k:
            heapq.heappop(heap)  # remove least frequent
    return [val for cnt, val in
            sorted(heap, reverse=True)]
# Time: O(n + u log k)  — better when k << u

Method	Time	Best when
sorted()	O(u log u)	k close to u
Min-heap	O(u log k)	k << u

Follow-up 2

How would you sort the result in ascending order of frequency instead (least frequent first)?

Simply remove the minus sign from the sort key — (freq[x], x) instead of (-freq[x], x).

# Ascending frequency, ascending value on tie
sorted_elements = sorted(freq.keys(),
                          key=lambda x: (freq[x], x))

# Descending frequency, descending value on tie
sorted_elements = sorted(freq.keys(),
                          key=lambda x: (-freq[x], -x))

Tuple sort keys are the cleanest way to handle multi-level sorting in Python. Master this pattern — it comes up constantly.

Follow-up 3

What does Counter actually do internally? What's the complexity of building it?

Counter is a subclass of dict. It iterates through the list once and increments a count for each element — exactly equivalent to a manual hashmap loop.

Time: O(n) — one pass through the input
Space: O(u) — one entry per unique element
Each increment is O(1) average (hash table access)

Saying "Counter is O(n) because it's a single-pass hashmap" signals that you understand what's happening under the hood — exactly what BCG probes.

Pick an exercise for this candidate

Use the button below to reveal a different exercise. Rotate across mock sessions so candidates don't share answers.

Problem B · Closest Temperature to Zero

Easy–Medium

Given a list of non-zero integers representing temperatures, return the value closest to zero. If two values are equally close (e.g. -5 and 5), return the positive one. If the list is empty, return 0.

Example 1: [1, -2, -8, 4, 5] → 1
Example 2: [-5, 5, 3] → 5 (tie → positive wins)
Example 3: [] → 0
Example 4: [-1] → -1
Example 5: [0, -1, 2] → ❌ problem says non-zero — raise this!

Step 1 — Clarify Assumptions

Can the list be empty? → yes, return 0
Can there be zeros in the list? → problem says non-zero, but worth asking — if yes, 0 is trivially the closest
Tie-breaking rule: if two values have the same absolute value, return the positive one
What's the return type? Same type as input (int)
Single element? → return it directly

Step 2 — Brute Force

Sort by absolute value, pick the first. But sorting loses control over the tie-breaking rule — need to handle that explicitly.

def closest_to_zero_brute(temps: list[int]) -> int:
    if not temps: return 0
    temps.sort(key=lambda x: (abs(x), -x))  # abs asc, positive first on tie
    return temps[0]
# Time: O(n log n)  Space: O(1)

Step 3 — Optimize: Single Pass O(n)

Track the best candidate. Update when we find a strictly closer value, or an equally close but positive value.

def closest_to_zero(temps: list[int]) -> int:
    if not temps:
        return 0

    closest = temps[0]

    for t in temps[1:]:
        # strictly closer → update
        if abs(t) < abs(closest):
            closest = t
        # equally close → prefer positive
        elif abs(t) == abs(closest) and t > 0:
            closest = t

    return closest
# Time: O(n)  Space: O(1)

Step 4 — Dry Run

temps = [-5, 5, 3]

closest = -5

t = 5: abs(5)==abs(-5) and 5>0 → closest = 5
t = 3: abs(3) < abs(5)         → closest = 3

return 3  ✓

---
temps = [-5, 5]

closest = -5
t = 5: abs(5)==abs(-5) and 5>0 → closest = 5

return 5  ✓  (tie → positive)

Step 5 — Complexity

Approach	Time	Space
Sort-based	O(n log n)	O(1)
Single pass (optimal)	O(n)	O(1)

Step 6 — Edge Cases

Empty list → return 0 ✓
Single element → returned immediately ✓
All negative → returns the one with smallest absolute value ✓
Tie between -x and +x → positive wins via elif t > 0 ✓
Multiple ties (e.g. [-3, 3, -3, 3]) → first 3 encountered wins, then re-updated to 3 → still 3 ✓

The tie-breaking condition elif abs(t) == abs(closest) and t > 0 is the most common mistake — candidates forget it or use wrong operator. Proactively mentioning it before coding is a strong signal.

Problem C · Valid Anagram

Easy

Given two strings s1 and s2, determine whether they are anagrams of each other. Two strings are anagrams if they contain the same characters with the same frequencies, ignoring spaces and case. Return a boolean.

Example 1: "listen" / "silent" → True
Example 2: "bad credit" / "debit card" → True (spaces ignored)
Example 3: "hello" / "world" → False
Example 4: "" / "" → True

Step 1 — Clarify Assumptions

Should spaces be ignored? → yes (BCG example: "bad credit" / "debit card")
Case-sensitive? → no, lowercase before comparing
Only ASCII letters? → assume yes
Empty strings: are they anagrams of each other? → yes (both empty after stripping)

Step 2 — Brute Force (sorted)

Sort both cleaned strings and compare. Simple but O(n log n).

def is_anagram_sort(s1: str, s2: str) -> bool:
    clean = lambda s: sorted(s.lower().replace(" ", ""))
    return clean(s1) == clean(s2)
# Time: O(n log n)  Space: O(n)

One-liner is elegant but BCG wants you to explain it step by step first, then show this as the compact version.

Step 3 — Optimize: HashMap O(n)

Build a frequency map for s1, decrement for s2. If all counts reach 0, they're anagrams.

def is_anagram(s1: str, s2: str) -> bool:
    # normalize: lowercase, remove spaces
    s1 = s1.lower().replace(" ", "")
    s2 = s2.lower().replace(" ", "")

    if len(s1) != len(s2):
        return False  # early exit

    count = {}
    for ch in s1:
        count[ch] = count.get(ch, 0) + 1

    for ch in s2:
        count[ch] = count.get(ch, 0) - 1
        if count[ch] < 0:
            return False  # s2 has more of this char than s1

    return True
# Time: O(n)  Space: O(1) — bounded by 26 letters

Step 4 — Dry Run

s1 = "bad credit" → "badcredit"
s2 = "debit card" → "debitcard"

Build count from s1: {b:1, a:1, d:2, c:1, r:1, e:1, i:1, t:1}

Decrement with s2:
  d→1, e→0, b→0, i→0, t→0, c→0, a→0, r→0, d→0

All counts ≥ 0 → True ✓

Step 5 — Complexity

Approach	Time	Space
Sorted comparison	O(n log n)	O(n)
HashMap (optimal)	O(n)	O(1)*

*O(1) because alphabet is fixed size (26). O(k) if unicode.

Step 6 — Edge Cases

Different lengths after normalization → early return False ✓
Empty strings → len check passes (0==0), both loops skip → True ✓
"bad credit" / "debit card" → spaces stripped before comparison ✓
Same string twice → always True ✓

Problem D · Two Sum

Easy

Given an array of integers nums and a target integer target, return the indices of the two numbers that add up to target. Each input has exactly one valid solution. You may not use the same element twice.

Example 1: nums=[2,7,11,15], target=9 → [0,1]
Example 2: nums=[3,2,4], target=6 → [1,2]
Example 3: nums=[3,3], target=6 → [0,1]

Step 1 — Clarify Assumptions

Return indices or values? → indices
Can the same element be used twice? → no (but two different indices with same value are OK — e.g. [3,3])
Is the array sorted? → do not assume
Exactly one solution guaranteed → no need to handle "not found"

Step 2 — Brute Force

def two_sum_brute(nums, target):
    for i in range(len(nums)):
        for j in range(i+1, len(nums)):
            if nums[i] + nums[j] == target:
                return [i, j]
# Time: O(n²)  Space: O(1)

Step 3 — Optimize: HashMap O(n)

For each number, check if its complement (target - num) is already in the hashmap. Store value → index as we go.

def two_sum(nums: list[int], target: int) -> list[int]:
    seen = {}  # value → index

    for i, num in enumerate(nums):
        complement = target - num
        if complement in seen:
            return [seen[complement], i]
        seen[num] = i  # store AFTER check to avoid self-use

# Time: O(n)  Space: O(n)

Step 4 — Dry Run

nums=[3,2,4], target=6

i=0, num=3: complement=3, not in seen={} → seen={3:0}
i=1, num=2: complement=4, not in seen={3:0} → seen={3:0,2:1}
i=2, num=4: complement=2, 2 in seen → return [1, 2] ✓

Step 5 — Complexity

Approach	Time	Space
Brute force (nested loops)	O(n²)	O(1)
HashMap one-pass	O(n)	O(n)

Step 6 — Edge Cases

[3,3] target=6 → i=0 stores 3:0, i=1 complement=3 found at index 0 → [0,1] ✓ (store after check prevents self-use)
Negative numbers → complement still works (e.g. target=0, num=-3 → complement=3) ✓
Single element → no valid pair, but problem guarantees one exists ✓

The "store AFTER check" detail is the classic trap. Asking about [3,3] target=6 proactively is a very strong signal to the interviewer.

Problem E · Merge Intervals

Medium

Given an array of intervals [[start, end], ...], merge all overlapping intervals and return the result sorted by start time in ascending order.

Example 1: [[1,3],[2,6],[8,10],[15,18]] → [[1,6],[8,10],[15,18]]
Example 2: [[1,4],[4,5]] → [[1,5]] (touching = overlapping)
Example 3: [[1,4],[2,3]] → [[1,4]] (contained inside)
Example 4: [] → []

Step 1 — Clarify Assumptions

Is [1,4] and [4,5] overlapping? → yes, touching counts (end == start)
Is [1,4] and [2,3] one interval? → yes, fully contained → merge to [1,4]
Are intervals sorted on input? → do not assume, sort first
Can the list be empty? → yes, return []

Step 2 — Brute Force sketch

Compare every pair of intervals. O(n²) — too slow for large inputs. Mention and move on.

Step 3 — Optimal: Sort + Linear Merge

Sort by start time. Iterate and either extend the current interval or start a new one.

def merge_intervals(intervals: list[list[int]]) -> list[list[int]]:
    if not intervals:
        return []

    # Sort ascending by start — O(n log n)
    intervals.sort(key=lambda x: x[0])

    merged = [intervals[0]]

    for start, end in intervals[1:]:
        last_end = merged[-1][1]

        if start <= last_end:
            # overlapping → extend the current interval
            merged[-1][1] = max(last_end, end)
        else:
            # gap → start a new interval
            merged.append([start, end])

    return merged
# Time: O(n log n)  Space: O(n)

Step 4 — Dry Run

intervals = [[1,3],[2,6],[8,10],[15,18]]

After sort: [[1,3],[2,6],[8,10],[15,18]]  (already sorted)

merged = [[1,3]]

[2,6]: 2 ≤ 3 → overlap → max(3,6)=6 → merged=[[1,6]]
[8,10]: 8 > 6 → new → merged=[[1,6],[8,10]]
[15,18]: 15 > 10 → new → merged=[[1,6],[8,10],[15,18]]

return [[1,6],[8,10],[15,18]] ✓

---
Edge: [[1,4],[2,3]]
After sort: [[1,4],[2,3]]
[2,3]: 2 ≤ 4 → max(4,3)=4 → merged=[[1,4]] ✓  (contained)

Step 5 — Complexity

Step	Time	Space
Sort	O(n log n)	O(1)*
Linear merge pass	O(n)	O(n) output
Total	O(n log n)	O(n)

*Python sort is in-place (Timsort), O(log n) stack space.

Step 6 — Edge Cases

Empty input → return [] ✓
Single interval → returned as-is ✓
[1,4],[4,5] → 4 ≤ 4 → merged to [1,5] ✓ (touching)
[1,4],[2,3] → contained → max(4,3)=4 → [1,4] ✓
All overlapping → one big interval ✓
None overlapping → original list returned ✓

The max(last_end, end) is critical — without it, a fully contained interval like [2,3] inside [1,4] would incorrectly shrink the result to [1,3].

Problem F · Prime Factorization

Medium

Write a function that takes a positive integer n and returns its prime factorization as a sorted list of prime factors.

Example: 12 → [2, 2, 3]
Example: 60 → [2, 2, 3, 5]
Example: 13 → [13] (already prime)

Step 1 — Clarify Assumptions

Can I assume n is a positive integer greater than 1? → yes
Return a list of factors or a formatted string like "2 × 2 × 3"? → list (easier to test)
Should the list be sorted? → yes, naturally sorted by the algorithm
What about n = 1? → 1 has no prime factors → return [] (good edge case to raise)

Step 2 — Brute Force Intuition

Try every divisor from 2 up to n. For each one, divide as many times as possible. Works but is O(n) in the worst case (e.g. n is prime).

Step 3 — Optimize: Stop at √n

Key insight: if n has no prime factor ≤ √n, then n itself is prime. So we only need to test divisors up to √n — which gives O(√n) time.

def prime_factorization(n: int) -> list[int]:
    factors = []
    divisor = 2

    while divisor * divisor <= n:  # only up to √n
        while n % divisor == 0:     # divide out all copies
            factors.append(divisor)
            n //= divisor
        divisor += 1

    if n > 1:                      # remaining n is prime
        factors.append(n)

    return factors
# Time: O(√n)  Space: O(k) — k = number of prime factors

Step 4 — Dry Run

n = 60

divisor=2: 60%2==0 → append 2, n=30
           30%2==0 → append 2, n=15
           15%2≠0 → stop
divisor=3: 15%3==0 → append 3, n=5
           5%3≠0  → stop
divisor=4: 4*4=16 > 5 → loop exits

n=5 > 1 → append 5

return [2, 2, 3, 5] ✓

---
n = 13 (prime)

divisor=2: 2*2=4 ≤ 13, 13%2≠0
divisor=3: 3*3=9 ≤ 13, 13%3≠0
divisor=4: 4*4=16 > 13 → loop exits

n=13 > 1 → append 13

return [13] ✓

Step 5 — Complexity

Approach	Time	Space	Notes
Brute force (divisor up to n)	O(n)	O(k)	Slow for large primes
Stop at √n (optimal)	O(√n)	O(k)	k = number of prime factors

k is very small — at most log₂(n) factors (e.g. 2^30 has 30 factors but n ≈ 10^9).

Step 6 — Edge Cases

n = 1 → loop doesn't execute, n=1 not > 1 → return [] ✓
n is prime (e.g. 13) → inner loop never appends, remaining n > 1 → appended at end ✓
n is a perfect square (e.g. 4) → divisor=2, appended twice, n=1 → return [2,2] ✓
n is a large prime (e.g. 999983) → outer loop runs only √999983 ≈ 1000 times, not 999983 ✓

The "why stop at √n?" question is the core probe: if n = a × b with a ≤ b, then a ≤ √n. So if we've found no factor up to √n, there can't be one — n must be prime.

Interviewer Follow-up Questions

Follow-up 1

How would you return the result as a formatted string "2 × 2 × 3" instead of a list?

return " × ".join(str(f) for f in factors)
# [2, 2, 3] → "2 × 2 × 3"

O(k) extra pass to stringify. Negligible. Good to mention that separating the concerns (compute vs format) is the right approach.

Follow-up 2

Can you further optimize by only testing odd divisors after 2?

Yes — once we've divided out all 2s, no even number can be a factor. So we only need to test 2, then 3, 5, 7, 9… reducing iterations by ~50%.

def prime_factorization_opt(n: int) -> list[int]:
    factors = []
    while n % 2 == 0:          # handle 2 separately
        factors.append(2)
        n //= 2
    divisor = 3
    while divisor * divisor <= n:
        while n % divisor == 0:
            factors.append(divisor)
            n //= divisor
        divisor += 2           # odd numbers only
    if n > 1: factors.append(n)
    return factors
# Still O(√n) but ~2x faster in practice

Follow-up 3

What is the maximum number of distinct prime factors a number n can have?

The product of the first k primes gives the smallest number with k distinct prime factors.

2×3×5×7×11×13×17×19×23 = 223,092,870 — that's 9 distinct primes, still fits in an int
For a 64-bit integer (~9.2×10¹⁸), the maximum number of distinct prime factors is 15
So k is extremely bounded in practice — space complexity O(k) ≈ O(1) for fixed-size integers

This answer shows mathematical depth — BCG will appreciate that you can bound k analytically, not just say "it's small".

Problem G · Valid Parentheses

Easy

Given a string containing only '(', ')', '{', '}', '[', ']', determine if it is valid. Open brackets must be closed by the same type in the correct order.

"()[]{}" → True | "(]" → False | "([)]" → False | "{[]}" → True

Step 1 — Clarify

Can the string be empty? → yes, empty string is valid (return True)
Only those 6 characters guaranteed? → assume yes
What about whitespace or other chars? → not in this problem

Step 2 — Intuition

Classic stack problem. Push opening brackets. On closing bracket, check it matches the top of the stack.

Step 3 — Solution

def is_valid(s: str) -> bool:
    stack = []
    mapping = {')': '(', '}': '{', ']': '['}

    for char in s:
        if char in mapping:             # closing bracket
            top = stack.pop() if stack else '#'
            if mapping[char] != top:
                return False
        else:
            stack.append(char)          # opening bracket

    return len(stack) == 0
# Time: O(n)  Space: O(n)

Step 4 — Complexity

Metric	Value	Why
Time	O(n)	Single pass through string
Space	O(n)	Worst case: all opening brackets on stack

Step 5 — Edge Cases

Empty string → stack empty → True ✓
Starts with closing bracket → stack empty → pop returns '#' → mismatch → False ✓
Odd-length string → can never be valid → early return optimization
"(((" → loop ends, stack not empty → return False ✓

The '#' sentinel avoids an IndexError when the stack is empty and we receive a closing bracket — cleaner than an explicit if stack check inside the condition.

Problem H · Longest Substring Without Repeating Characters

Medium

Given a string s, find the length of the longest substring without repeating characters.

"abcabcbb" → 3 ("abc") | "bbbbb" → 1 | "pwwkew" → 3 ("wke") | "" → 0

Step 1 — Clarify

Can the string be empty? → yes, return 0
Only ASCII? → assume yes for simplicity
Is the answer the length or the substring itself? → length

Step 2 — Brute Force

Check all O(n²) substrings, verify each in O(n). Total O(n³) — too slow. Not worth coding, just mention it.

Step 3 — Sliding Window (Optimal)

Maintain a window [left, right] with a set of current characters. Expand right; when a duplicate appears, shrink from left.

def length_of_longest_substring(s: str) -> int:
    char_set = set()
    left = 0
    max_len = 0

    for right in range(len(s)):
        while s[right] in char_set:   # shrink until no duplicate
            char_set.remove(s[left])
            left += 1
        char_set.add(s[right])
        max_len = max(max_len, right - left + 1)

    return max_len
# Time: O(n)  Space: O(k) — k = charset size

Step 4 — Complexity

Approach	Time	Space
Brute force	O(n³)	O(1)
Sliding window + set	O(n)	O(k) charset
Sliding window + dict (optimized)	O(n)	O(k)

Step 5 — Edge Cases

Empty string → loop never runs → returns 0 ✓
All same chars ("aaaa") → window always resets to size 1 ✓
All unique → window expands to full length ✓
Single char → returns 1 ✓

Follow-up

Can you optimize to avoid shrinking the window one step at a time?

Use a dict to store the last seen index of each character. When a duplicate is found, jump left directly to last_seen[char] + 1.

def length_of_longest_substring_opt(s: str) -> int:
    last_seen = {}
    left = 0
    max_len = 0
    for right, char in enumerate(s):
        if char in last_seen and last_seen[char] >= left:
            left = last_seen[char] + 1  # jump directly
        last_seen[char] = right
        max_len = max(max_len, right - left + 1)
    return max_len
# Still O(n) but no inner while loop — single clean pass

Problem I · Number of Islands

Medium

Given an m × n grid of '1's (land) and '0's (water), return the number of islands. An island is surrounded by water and formed by connecting adjacent lands horizontally or vertically.

[["1","1","0"],["0","1","0"],["0","0","1"]] → 2

Step 1 — Clarify

Can the grid be empty? → yes, return 0
Diagonal connections count? → no, only horizontal/vertical
Can we modify the grid in place? → yes (if not, use a visited set)

Step 2 — Approach: DFS flood fill

Iterate the grid. When we find a '1', increment counter and DFS to mark all connected land as visited (set to '0'). This "floods" the island so we don't count it again.

def num_islands(grid: list[list[str]]) -> int:
    if not grid: return 0
    rows, cols = len(grid), len(grid[0])
    count = 0

    def dfs(r, c):
        if r < 0 or r >= rows or c < 0 or c >= cols:
            return
        if grid[r][c] != '1': return
        grid[r][c] = '0'           # mark visited
        dfs(r+1, c); dfs(r-1, c)
        dfs(r, c+1); dfs(r, c-1)

    for r in range(rows):
        for c in range(cols):
            if grid[r][c] == '1':
                count += 1
                dfs(r, c)
    return count
# Time: O(m×n)  Space: O(m×n) call stack worst case

Step 4 — Complexity

Metric	Value	Why
Time	O(m×n)	Each cell visited at most once
Space	O(m×n)	DFS call stack depth — worst case entire grid is land

Step 5 — Edge Cases

Empty grid → return 0 ✓
All water → count stays 0 ✓
All land → entire grid is one island → return 1 ✓
Single cell '1' → return 1 ✓

Follow-up

How would you implement this with BFS instead of DFS? What's the practical difference?

from collections import deque
def bfs(r, c):
    queue = deque([(r, c)])
    grid[r][c] = '0'
    while queue:
        row, col = queue.popleft()
        for dr, dc in [(1,0),(-1,0),(0,1),(0,-1)]:
            nr, nc = row+dr, col+dc
            if 0 <= nr < rows and 0 <= nc < cols and grid[nr][nc] == '1':
                grid[nr][nc] = '0'
                queue.append((nr, nc))

BFS uses an explicit queue — avoids Python's recursion limit on very large grids
DFS is simpler to write, BFS is safer for production on large inputs
Same time and space complexity: O(m×n)

Problem J · LRU Cache

Hard — Lead Engineers

Design a data structure for a Least Recently Used (LRU) cache that supports get(key) and put(key, value) in O(1) time. When capacity is reached, evict the least recently used item.

LRUCache(2) → put(1,1) → put(2,2) → get(1)=1 → put(3,3) [evicts 2] → get(2)=-1

Step 1 — Clarify

What does "recently used" mean? → both get and put count as use
What to return if key not found? → -1
Can capacity be 0? → assume capacity ≥ 1

Step 2 — Key Insight

O(1) get + O(1) put + O(1) eviction requires two data structures combined: a HashMap for O(1) key lookup + a Doubly Linked List to track recency order with O(1) insert/delete.

Most recent → head of list
Least recent → tail of list (eviction target)
HashMap maps key → node pointer for O(1) access anywhere in the list

Step 3 — Solution

class Node:
    def __init__(self, key=0, val=0):
        self.key = key; self.val = val
        self.prev = self.next = None

class LRUCache:
    def __init__(self, capacity: int):
        self.cap = capacity
        self.cache = {}                 # key → node
        self.head = Node()              # dummy head (most recent)
        self.tail = Node()              # dummy tail (least recent)
        self.head.next = self.tail
        self.tail.prev = self.head

    def _remove(self, node):
        node.prev.next = node.next
        node.next.prev = node.prev

    def _insert_front(self, node):  # insert after head
        node.next = self.head.next
        node.prev = self.head
        self.head.next.prev = node
        self.head.next = node

    def get(self, key: int) -> int:
        if key not in self.cache: return -1
        node = self.cache[key]
        self._remove(node)
        self._insert_front(node)    # mark as most recent
        return node.val

    def put(self, key: int, val: int):
        if key in self.cache:
            self._remove(self.cache[key])
        node = Node(key, val)
        self.cache[key] = node
        self._insert_front(node)
        if len(self.cache) > self.cap:
            lru = self.tail.prev      # evict least recent
            self._remove(lru)
            del self.cache[lru.key]
# get: O(1)  put: O(1)  evict: O(1)

Step 4 — Complexity

Operation	Time	Why
get(key)	O(1)	HashMap lookup + linked list move
put(key, val)	O(1)	HashMap insert + linked list insert + O(1) eviction
Space	O(capacity)	HashMap + list both bounded by capacity

Python shortcut: from collections import OrderedDict — OrderedDict maintains insertion order and has move_to_end(key) and popitem(last=False). Acceptable in interviews but BCG X will ask you to implement from scratch to test your data structure depth.

Step 5 — Edge Cases

get on missing key → -1 ✓
put same key twice → update value, move to front ✓
capacity = 1 → every new put evicts the previous one ✓
put then immediate eviction → correct key removed from both HashMap and list ✓

Problem K · Group Anagrams

Medium

Given a list of strings, group the anagrams together. Each group must be sorted alphabetically. The final list of groups must be sorted in ascending order by the first word of each group.

["eat","tea","tan","ate","nat","bat"] → [["ant","nat","tan"], ["ate","eat","tea"], ["bat"]]

Step 1 — Clarify

Can input be empty? → yes, return []
Lowercase only? → assume yes
Sort groups by first word ascending? → yes

Step 2 — Key Insight

Two words are anagrams iff their sorted version is identical. Use that sorted string as a hashmap key to group all anagrams in one pass.

from collections import defaultdict

def group_anagrams(strs: list[str]) -> list[list[str]]:
    groups = defaultdict(list)
    for word in strs:
        key = "".join(sorted(word))   # "eat" → "aet"
        groups[key].append(word)

    result = [sorted(g) for g in groups.values()]
    result.sort(key=lambda g: g[0])  # sort groups by first word
    return result
# Time: O(n·k log k)  Space: O(n·k)

Step 3 — Complexity

Approach	Time	Space
Brute force (pairwise compare)	O(n²·k log k)	O(n)
Sorted key + HashMap	O(n·k log k)	O(n·k)
Frequency tuple key	O(n·k)	O(n·k)

Step 4 — Edge Cases

Empty list → return [] ✓
Single char strings → sorted("a")="a" → works ✓
All words are anagrams → one group returned ✓

Follow-up — O(n·k) with frequency tuple

def group_anagrams_ok(strs):
    groups = defaultdict(list)
    for word in strs:
        count = [0] * 26
        for ch in word:
            count[ord(ch) - ord('a')] += 1
        groups[tuple(count)].append(word)
    return [sorted(g) for g in groups.values()]
# Key building O(k) instead of O(k log k)

Problem L · Binary Search

Easy — Fondamentaux

Given a sorted array of distinct integers and a target, return its index. If not found, return -1. You must achieve O(log n) time complexity.

nums=[-1,0,3,5,9,12], target=9 → 4 | target=2 → -1

Step 1 — Clarify

Array is sorted and contains distinct integers → confirmed
Return index or value? → index
Can array be empty? → yes, return -1

Step 2 — Solution

def binary_search(nums: list[int], target: int) -> int:
    left, right = 0, len(nums) - 1

    while left <= right:
        mid = left + (right - left) // 2  # avoids overflow vs (l+r)//2
        if nums[mid] == target:
            return mid
        elif nums[mid] < target:
            left = mid + 1
        else:
            right = mid - 1

    return -1
# Time: O(log n)  Space: O(1)

Step 3 — Why O(log n)?

Each iteration halves the search space. After k steps, remaining elements = n / 2^k. When n / 2^k = 1 → k = log₂(n). For n = 1,000,000: only ~20 comparisons.

Step 4 — Edge Cases

Empty array → left=0, right=-1 → while condition false → -1 ✓
Target smaller than all elements → right keeps shrinking → -1 ✓
Target is first or last element → found on first or last iteration ✓
Single element array → mid=0, check once ✓

BCG note on mid = left + (right - left) // 2: in Python integers don't overflow, so (left + right) // 2 is safe too — but mentioning overflow prevention signals production awareness valued by BCG X.

Follow-up

How would you find the first occurrence of target in an array with duplicates?

When nums[mid] == target, don't return — save the index and keep searching left (right = mid - 1).

def first_occurrence(nums, target):
    left, right, result = 0, len(nums)-1, -1
    while left <= right:
        mid = left + (right - left) // 2
        if nums[mid] == target:
            result = mid
            right = mid - 1   # keep searching left
        elif nums[mid] < target: left = mid + 1
        else: right = mid - 1
    return result

Problem M · K Closest Points to Origin

Medium

Given an array of points [[x1,y1],[x2,y2],...] and an integer k, return the k closest points to the origin (0,0). Distance is Euclidean. Order of output does not matter.

points=[[1,3],[-2,2]], k=1 → [[-2,2]] (√8 < √10)

Step 1 — Clarify

Do we need to sort the output? → no, any order is fine
Do we need the actual distance or just compare? → just compare → we can skip √ (monotone)
k guaranteed ≤ len(points)? → assume yes

Step 2 — Key Insight: skip sqrt

√(x²+y²) is monotonically increasing, so we can compare x²+y² directly — no need to compute the square root. Saves computation and avoids floats.

Step 3 — Approach A: Sort (simple)

def k_closest_sort(points, k):
    points.sort(key=lambda p: p[0]**2 + p[1]**2)
    return points[:k]
# Time: O(n log n)  Space: O(1)

Step 4 — Approach B: Max-Heap of size k (optimal)

import heapq

def k_closest_heap(points: list, k: int) -> list:
    heap = []
    for x, y in points:
        dist = -(x*x + y*y)          # negate → max-heap via min-heap
        heapq.heappush(heap, (dist, x, y))
        if len(heap) > k:
            heapq.heappop(heap)      # remove farthest
    return [[x, y] for _, x, y in heap]
# Time: O(n log k)  Space: O(k)

Step 5 — Complexity Comparison

Approach	Time	Space	Best when
Sort	O(n log n)	O(1)	k close to n
Max-heap size k	O(n log k)	O(k)	k << n
Quickselect	O(n) avg	O(1)	optimal but complex

Step 6 — Edge Cases

k = len(points) → return all points ✓
Points at origin [0,0] → dist=0, always closest ✓
Negative coordinates → x²+y² handles them correctly ✓
k = 1 → return the single closest point ✓

Mention proactively: "I skip the square root because it's monotone — this is a common optimization in geometry problems and shows awareness of computational cost."

Problem N · Octavian Calendar — Lunar Phase

Medium — Logic + Modulo

On planet Octavia, time is tracked using 8 repeating lunar phases: NewMoon, Crescent, Quarter, Gibbous, Full, Waning, Eclipse, Twilight. The year has 12 seasons (same lengths as Earth months). Given a season, a dayCount, and the initialPhase of day 1 of the year, return the lunar phase of that day.

season="January", dayCount=4, initialPhase="Full" → "Twilight"
season="February", dayCount=4, initialPhase="Crescent" → "Gibbous"

Step 1 — Clarify Assumptions

Are seasons always non-leap year lengths? → yes (Jan=31, Feb=28, Mar=31…)
Is dayCount 1-indexed? → yes, day 1 = initialPhase
Phases cycle strictly in order, wrapping around? → yes, after Twilight comes NewMoon
Is initialPhase always a valid phase name? → assume yes

Step 2 — Reasoning

Compute total days elapsed from day 1 of the year up to and including the target date.

Offset = total_days - 1 (day 1 is offset 0, so it maps to initialPhase directly).

Find the index of initialPhase in the phases list.

Result index = (initial_index + offset) % 8.

Step 3 — Dry Run (Example 2)

season = "February", dayCount = 4, initialPhase = "Crescent"

Days in January = 31
Days in February up to day 4 = 4
Total days from start of year = 31 + 4 = 35

offset = 35 - 1 = 34

phases = [NewMoon, Crescent, Quarter, Gibbous, Full, Waning, Eclipse, Twilight]
index of "Crescent" = 1

result_index = (1 + 34) % 8 = 35 % 8 = 3

phases[3] = "Gibbous" ✓

Step 4 — Solution

def solution(season: str, dayCount: int, initialPhase: str) -> str:
    phases = [
        "NewMoon", "Crescent", "Quarter", "Gibbous",
        "Full", "Waning", "Eclipse", "Twilight"
    ]
    days_per_season = {
        "January": 31, "February": 28, "March":    31,
        "April":   30, "May":      31, "June":     30,
        "July":    31, "August":   31, "September":30,
        "October": 31, "November": 30, "December": 31
    }
    season_order = list(days_per_season.keys())

    # Sum all complete seasons before the target season
    total_days = 0
    for s in season_order:
        if s == season:
            break
        total_days += days_per_season[s]

    # Add the day within the target season
    total_days += dayCount

    # offset from day 1 (day 1 = initialPhase, offset 0)
    offset = total_days - 1

    initial_index = phases.index(initialPhase)
    result_index  = (initial_index + offset) % 8

    return phases[result_index]
# Time: O(1) — at most 12 seasons to iterate   Space: O(1)

Step 5 — Complexity

Metric	Value	Why
Time	O(1)	At most 12 iterations + constant-time modulo
Space	O(1)	Fixed-size lookup tables (8 phases, 12 seasons)

Step 6 — Edge Cases

dayCount = 1 and season = "January" → offset = 0 → returns initialPhase directly ✓
Phase wraps past Twilight → modulo 8 resets to NewMoon ✓
December day 31 → total_days = 365, offset = 364 → (initial + 364) % 8 ✓
initialPhase = "Twilight", offset = 1 → (7+1) % 8 = 0 → "NewMoon" ✓

The key BCG insight to verbalize: "I reduce the problem to a single number — total elapsed days from the start of the year — then use modulo 8 to navigate the cyclic phase list. This avoids any nested loops or conditionals."

Interviewer Follow-up Questions

Follow-up 1

How would you handle leap years where February has 29 days?

Add a is_leap_year parameter and adjust February's day count conditionally.

def solution(season, dayCount, initialPhase, year=None):
    is_leap = year and (year % 4 == 0 and
              (year % 100 != 0 or year % 400 == 0))
    days_per_season["February"] = 29 if is_leap else 28
    # rest of function unchanged

Leap year rule: divisible by 4, except centuries, except 400s. Mentioning this unprompted signals production-level thinking.

Follow-up 2

If you had to call this function millions of times, how would you optimize it?

Precompute a cumulative day offset for each season at startup (O(12) once), so each call becomes a pure O(1) lookup + modulo — no iteration at all.

# Precompute once
season_start_day = {}
cumulative = 0
for s, d in days_per_season.items():
    season_start_day[s] = cumulative
    cumulative += d

# Per call: pure O(1)
def solution_fast(season, dayCount, initialPhase):
    offset = season_start_day[season] + dayCount - 1
    idx = (phases.index(initialPhase) + offset) % 8
    return phases[idx]

This is the "precompute vs recompute" trade-off — a recurring BCG X theme. Always ask: "what's the call pattern?" before choosing your optimization.

Phase 04 · ~7 min

Complexity Deep-Dive

Follow-up Qs

Big-O

Focus

Minutes

Complexity — Q1

Is O(n log n) bigger or smaller than O(n)? Explain with an example.

O(n log n) is bigger — it grows faster than O(n).

For n = 1,000,000: O(n) = 1,000,000 operations. O(n log n) ≈ 20,000,000 operations (log₂(10⁶) ≈ 20).

So n log n is roughly 20x slower for large inputs in this case
However, O(n log n) is still very efficient — most sorting algorithms (merge sort, heap sort) are O(n log n)
Hierarchy: O(1) < O(log n) < O(n) < O(n log n) < O(n²) < O(2ⁿ)

Complexity — Q2

Can you solve the anagram problem in O(n)? If yes, how? If no, why not?

Yes, using a frequency hashmap (dictionary).

def is_anagram(s1: str, s2: str) -> bool:
    if len(s1) != len(s2):
        return False

    count = {}
    for ch in s1:
        count[ch] = count.get(ch, 0) + 1  # build freq map
    for ch in s2:
        count[ch] = count.get(ch, 0) - 1  # decrement
        if count[ch] < 0:
            return False

    return True
# Time: O(n)  Space: O(1) — bounded by 26 letters

Approach	Time	Why
sorted() comparison	O(n log n)	Sorting is O(n log n)
Frequency hashmap	O(n)	Two linear passes, O(1) dict access

Key statement: "Dictionary lookup is O(1) on average — that's what enables us to go from O(n log n) to O(n)."

Complexity — Q3

What is the time complexity of binary search and why? When can you use it?

O(log n) — because each step halves the search space.

After k steps, remaining elements = n / 2^k. When n / 2^k = 1, we've found the answer → k = log₂(n).

Prerequisite: the array must be sorted
For n = 1,000,000: binary search takes ~20 comparisons vs 1,000,000 for linear search
Use cases: sorted arrays, rotated sorted arrays, "find first/last occurrence", search in answer space (binary search on the answer)

def binary_search(nums, target):
    left, right = 0, len(nums) - 1
    while left <= right:
        mid = (left + right) // 2
        if nums[mid] == target: return mid
        elif nums[mid] < target: left = mid + 1
        else: right = mid - 1
    return -1

Complexity — Q4

You have a solution that runs in O(n²). The interviewer asks you to optimize it. What patterns should you think of first?

Framework for optimization from O(n²):

HashMap/HashSet → eliminate inner loop by looking up complements in O(1). Two Sum, Anagram, duplicates detection.

Sliding Window → when the problem involves a contiguous subarray/substring. Moves a window rather than re-scanning.

Two Pointers → when working on a sorted array. Start from both ends and converge.

Sort first → O(n log n) pre-processing sometimes unlocks an O(n) second pass. Better than O(n²) overall.

Prefix sum / Memoization → avoid recomputing sub-results. Dynamic programming territory.

BCG wants to hear you think in patterns — naming the pattern before coding signals strong algorithmic thinking.

Complexity — Q5

What is the space complexity of a recursive DFS on a graph with n nodes and e edges?

Space complexity = O(n) for the call stack in the worst case.

In the worst case (a path graph), DFS goes n levels deep before backtracking → call stack depth = n
We also need O(n) for the visited set to avoid cycles
Total space: O(n)
Time complexity for DFS/BFS on a graph = O(n + e) — we visit each node once and each edge once

For very deep recursion on large graphs, Python's default recursion limit (~1000) can cause stack overflow → prefer iterative DFS with an explicit stack in production.

Cheat Sheet — Patterns to Master

Pattern	Typical Complexity	Key Problems
HashMap	O(n)	Two Sum, Anagram, Group Anagrams
Sliding Window	O(n)	Longest Substring, Max Subarray
Stack	O(n)	Valid Parentheses, Next Greater Element
BFS/DFS	O(n+e)	Number of Islands, Shortest Path
Binary Search	O(log n)	Sorted array search, rotated array
Heap / Priority Queue	O(n log k)	Top K Elements, K Closest Points
Sorting	O(n log n)	Merge Intervals, Meeting Rooms
Two Pointers	O(n)	Container With Most Water, 3Sum

Phase 05 · ~15 min

Scientific Paper — Read, Understand, Implement

Minutes Read

Questions

Impl.

From Scratch

The interviewer hands the candidate this paper extract. They have 5 minutes to read it silently, then questions follow. No Googling. No notes. Just comprehension, reasoning, and implementation.

📄 Paper Extract — Read for 5 minutes

TF-IDF & BM25 — Excerpt

TF-IDF (Term Frequency–Inverse Document Frequency) is a numerical statistic that reflects how important a word is to a document in a corpus.

TF(t, d) = count(t in d) / total_terms(d)
IDF(t, D) = log( N / (1 + df(t)) )
TF-IDF(t, d, D) = TF(t, d) × IDF(t, D)

Where N is the total number of documents, and df(t) is the number of documents containing term t. The +1 in the denominator prevents division by zero for terms appearing in all documents.

BM25 (Best Match 25) improves on TF-IDF with two key mechanisms: term frequency saturation and document length normalization.

score(d, q) = Σ IDF(qᵢ) × [ tf(qᵢ,d) × (k1+1) ] / [ tf(qᵢ,d) + k1 × (1 - b + b × |d|/avgdl) ]

IDF(qᵢ) = log( (N - df(qᵢ) + 0.5) / (df(qᵢ) + 0.5) + 1 )

k1 ∈ [1.2, 2.0] — controls TF saturation
b ∈ [0, 1] — controls length normalization (typically 0.75)
avgdl — average document length in the corpus

As tf grows, the numerator saturates: doubling term frequency no longer doubles the score. When b = 0, length normalization is disabled. When b = 1, full normalization is applied.

Paper Q1 — Comprehension

In TF-IDF, why do we apply a logarithm to the IDF component? What happens without it?

The log dampens the effect of rare terms. Without it, a term appearing in only 1 document out of 1,000,000 would get an IDF of 1,000,000 — completely dominating the score for any document that contains it, even once.

With log: IDF = log(1,000,000) ≈ 14 — significant but not overwhelming
Without log: IDF = 1,000,000 — absurdly large, kills all balance
The log also makes IDF = 0 for terms appearing in every document (log(N/N) = log(1) = 0) — which is the correct behavior for stopwords like "the", "is"

Key insight to say out loud: "The log compresses the range so that rare terms are weighted meaningfully but not explosively."

Paper Q2 — Comparison

What specific problem does BM25 solve that TF-IDF does not? Explain term frequency saturation and length normalization in your own words.

Problem 1 — TF saturation: in TF-IDF, if a term appears 100 times instead of 10, the score is 10× higher. But intuitively, a document mentioning "python" 100 times isn't 10× more relevant than one mentioning it 10 times. BM25 applies a saturation curve — score grows fast at first then plateaus as tf increases. Controlled by k1.

Problem 2 — Length normalization: a long document will naturally contain more term occurrences just because it's longer. TF-IDF doesn't account for this. BM25 penalizes long documents via the |d|/avgdl ratio, controlled by b. A term in a 50-word document is more meaningful than the same term in a 5,000-word document.

k1 = 0 → BM25 becomes binary (term present/absent only)
b = 0 → no length normalization (same as raw TF behavior)
b = 1 → full normalization (divides TF by document length)
Default k1=1.5, b=0.75 works well empirically across most IR tasks

Paper Q3 — Implementation

Implement TF-IDF scoring from scratch in Python. Given a corpus of documents and a query, return the top-3 most relevant documents ranked by their TF-IDF score. Then propose one concrete optimization.

Step 1 — TF-IDF Implementation

import math
from collections import Counter

def compute_tf(doc: list[str]) -> dict:
    """Term frequency: count(t) / total_terms"""
    counts = Counter(doc)
    total = len(doc)
    return {term: cnt / total for term, cnt in counts.items()}

def compute_idf(corpus: list[list[str]]) -> dict:
    """IDF: log(N / (1 + df(t))) for each term"""
    N = len(corpus)
    df = {}
    for doc in corpus:
        for term in set(doc):  # set → count doc once per term
            df[term] = df.get(term, 0) + 1
    return {term: math.log(N / (1 + df[term])) for term in df}

def tfidf_score(query: list[str], doc: list[str],
               idf: dict) -> float:
    """Score a single doc against a query"""
    tf = compute_tf(doc)
    return sum(tf.get(t, 0) * idf.get(t, 0) for t in query)

def top_k_tfidf(query_str: str, corpus_strs: list[str],
               k: int = 3) -> list[tuple]:
    corpus = [doc.lower().split() for doc in corpus_strs]
    query  = query_str.lower().split()
    idf    = compute_idf(corpus)

    scores = [(i, tfidf_score(query, doc, idf))
              for i, doc in enumerate(corpus)]
    scores.sort(key=lambda x: -x[1])  # descending score
    return scores[:k]

# --- Demo ---
corpus = [
    "python is great for data science",
    "data science uses python and R",
    "java is used for backend development",
    "python python python everywhere"
]
print(top_k_tfidf("python data", corpus))
# → [(1, 0.18), (0, 0.15), (3, 0.12)]  (approx)

Step 2 — Complexity

Step	Time	Space
Build IDF (offline)	O(N × L)	O(V)
Score one doc	O(\|query\|)	O(\|doc\|)
Score all docs + sort	O(N × L + N log N)	O(N)

N = corpus size, L = avg doc length, V = vocabulary size.

Step 3 — Proposed Optimizations

Inverted index: instead of scanning all documents per query term, precompute a mapping term → [doc_ids]. Query only hits documents that contain at least one query term — reduces N dramatically for large corpora.
Pre-normalize TF vectors: compute and cache TF at index time, not at query time. Score becomes a dot product lookup.
Sparse representation: store TF-IDF vectors as dicts (only non-zero terms) instead of dense arrays — memory O(V) → O(L) per doc.
Switch to BM25: replace the TF component with the saturation formula — typically 5–15% better NDCG on standard IR benchmarks.

The interviewer will probe: "Why did you use set(doc) in compute_idf?" → Answer: to count each document only once per term, regardless of how many times the term appears in that document. df(t) = number of documents containing t, not total occurrences.

Bonus — If time allows

Extend your implementation to BM25. Where exactly does the code change, and what new parameters do you need?

Only the scoring function changes — IDF computation and corpus indexing stay the same.

def bm25_score(query: list[str], doc: list[str],
              idf: dict, avgdl: float,
              k1: float = 1.5, b: float = 0.75) -> float:
    counts = Counter(doc)
    dl = len(doc)
    score = 0.0
    for term in query:
        tf = counts.get(term, 0)
        idf_val = idf.get(term, 0)
        # BM25 saturation + length norm
        numerator   = tf * (k1 + 1)
        denominator = tf + k1 * (1 - b + b * dl / avgdl)
        score += idf_val * (numerator / denominator)
    return score

# avgdl computed once at index time:
avgdl = sum(len(doc) for doc in corpus) / len(corpus)

2 new hyperparameters: k1 (saturation), b (length norm)
1 new corpus stat: avgdl (computed once, O(N×L))
The IDF formula also changes slightly — uses (N - df + 0.5) / (df + 0.5) for better probabilistic calibration

Key phrase to say: "The only structural change is in the TF numerator/denominator — everything else (inverted index, IDF caching, top-k selection) remains identical."