Question 1

What is Apache Kafka and what is its role in event-driven systems?

Accepted Answer

Apache Kafka is an open-source distributed event streaming platform. It is designed to ingest, process, store, and analyze real-time data streams at scale. In event-driven systems, Kafka acts as a high-throughput, fault-tolerant message broker and storage queue, decoupling publishers (producers) from consumers.

Question 2

Explain Topics, Partitions, and Offsets in Kafka.

Accepted Answer

- Topic: A named stream of messages to which producers publish data.
- Partition: Topics are split into multiple partitions distributed across brokers to enable scale and concurrency.
- Offset: A unique sequential integer assigned to each message within a partition, tracking message ordering.

Question 3

What are Producers and Consumers in Kafka?

Accepted Answer

- Producers: Applications that write and publish event messages to Kafka topics.
- Consumers: Applications that subscribe to topics, read messages, and process data streams.

Question 4

What is a Consumer Group in Kafka?

Accepted Answer

A Consumer Group is a collection of consumers that cooperate to consume messages from a topic. Kafka distributes partitions across the group: each partition is read by only one consumer in the group, enabling concurrent processing without duplicate consumption.

Question 5

Explain how Kafka stores messages and its commit log design.

Accepted Answer

Kafka stores messages as an append-only commit log on disk. Messages are written sequentially to the end of the log file, which is fast (avoiding random disk I/O). Messages are persistent and retained based on configured durations or size limits, letting consumers replay logs.

Question 6

What is a Broker in Apache Kafka?

Accepted Answer

A Broker is an individual Kafka server instance inside a cluster. Brokers store partitions, handle read/write requests from clients, and replicate data to other brokers to ensure fault tolerance.

Question 7

Explain how replication works in Kafka clusters.

Accepted Answer

Every partition has one Leader broker and multiple Follower brokers. The Leader handles all read and write requests. Followers replicate data from the leader. If the leader crashes, one of the followers is elected as the new leader.

Question 8

What is the role of Zookeeper (or KRaft) in Kafka clusters?

Accepted Answer

Zookeeper (or KRaft in modern Kafka) manages cluster state, tracks active brokers, coordinates leader elections for partitions, and stores configuration metadata, ensuring cluster consistency.

Question 9

Explain the difference between Kafka and traditional message brokers like RabbitMQ.

Accepted Answer

- RabbitMQ: Smart broker, dumb consumer. Deletes messages once processed, uses complex routing keys, and is ideal for basic task queues.
- Kafka: Dumb broker, smart consumer. Persists messages on disk, allows log replays, and is optimized for high-throughput event streaming.

Question 10

What is message retention in Kafka?

Accepted Answer

Message retention defines how long Kafka stores messages before deleting them (default is 7 days). You configure retention by time (`log.retention.hours`) or log size (`log.retention.bytes`), allowing consumers to read data historically.

Question 11

Explain the concept of partition reassignment.

Accepted Answer

Partition reassignment is the process of reallocating topic partitions across brokers. It is used to balance cluster loads, add new brokers to clusters, or decommission old servers.

Question 12

What is a record key in Kafka and how is it used for routing?

Accepted Answer

A record key is metadata attached to a message. If a key is provided, Kafka hashes the key to determine which partition to send the message to: `partition = hash(key) % partitionCount`, ensuring messages with the same key go to the same partition.

Question 13

Explain how to consume messages from a specific offset.

Accepted Answer

Consumers can manually seek to a specific offset using the `seek()` API. This allows replaying logs from a past offset or skipping messages, bypassing automatic offset commits.

Question 14

What is a compaction topic in Kafka?

Accepted Answer

Log compaction is a retention policy where Kafka retains only the latest message value for each key within a partition, discarding older updates, which is useful for state restorations.

Question 15

Explain the role of the bootstrap servers parameter.

Accepted Answer

The `bootstrap.servers` parameter is a list of broker addresses used by clients to establish initial connections. The client connects to one broker to retrieve the full cluster metadata (broker list, partition mappings).

Question 16

What is the difference between active and passive replication in Kafka?

Accepted Answer

- Active: The leader handles writes and replicates data to followers.
- Passive: Followers pull data from the leader asynchronously. Followers must stay in the In-Sync Replicas (ISR) list to be eligible for leader elections.

Question 17

Explain the purpose of the schema registry in Kafka.

Accepted Answer

The Schema Registry is a separate service that stores validation schemas (like Avro or JSON Schema) for Kafka message payloads, ensuring producers and consumers share compatible data formats.

Question 18

Explain Kafka consumer group rebalancing and how to prevent rebalance storms.

Accepted Answer

Rebalancing occurs when a consumer joins or leaves a group, or partitions are added, forcing Kafka to reassign partitions. A rebalance storm happens if consumers are slow to process messages, trigger timeouts, and leave the group, causing infinite rebalance loops. Prevent them by increasing `max.poll.interval.ms` or tuning `heartbeat.interval.ms`.

Question 19

Explain Kafka producer configurations for message delivery: acks=0, acks=1, and acks=all.

Accepted Answer

The `acks` parameter controls write confirmations:
- `acks=0`: Producer does not wait for confirmations, maximizing throughput but risking data loss.
- `acks=1`: Producer waits for the Leader broker to write to disk, protecting against connection drops.
- `acks=all` (or `-1`): Producer waits for the Leader and all In-Sync Replicas (ISR) to confirm writes, preventing data loss.

Question 20

How does Kafka guarantee message ordering within a topic partition?

Accepted Answer

Kafka guarantees strict message ordering *only* within a single partition. To preserve ordering: publish related messages with the same record key (routing them to the same partition) and configure `max.in.flight.requests.per.connection=1` on producers to prevent out-of-order retries.

Question 21

How do you write integration tests for Kafka producers and consumers using Testcontainers?

Accepted Answer

Use Testcontainers. In test setups, instantiate a Kafka container: `static KafkaContainer kafka = new KafkaContainer(DockerImageName.parse("confluentinc/cp-kafka:latest"))`. Start the container, configure client addresses, produce and consume messages, and assert payloads.

Question 22

Explain In-Sync Replicas (ISR) and partition leader elections in Kafka.

Accepted Answer

The ISR list contains replica brokers that are caught up with the partition leader. If the leader crashes, Kafka elects a follower *only* if it is in the ISR list. If no followers are in the ISR and `unclean.leader.election.enable` is true, Kafka elects an out-of-sync node, risking data loss.

Question 23

Explain how Kafka achieves high throughput using Zero-Copy and Page Cache techniques.

Accepted Answer

Kafka achieves high throughput by: 
1. Page Cache: Leveraging OS page caches in RAM instead of buffering in JVM heap.
2. Zero-Copy: Bypassing user-space memory copies. When a consumer reads, Kafka uses the `sendfile` system call to transfer log bytes from the page cache directly to the network socket.

Question 24

How do you monitor and resolve consumer lag in production?

Accepted Answer

Consumer lag is the offset difference between the latest produced message and the consumer's read offset. Monitor it using metrics collectors (like Burrow). Resolve by scaling consumer group sizes (up to partition counts) or tuning consumer configurations.

Question 25

How do you mock Kafka producers and consumers in unit tests?

Accepted Answer

Use MockProducer and MockConsumer classes from the `org.apache.kafka.clients.producer/consumer` packages. These mock classes simulate broker connections, letting you test message serialization and polling logic in unit tests.

Question 26

Explain how log cleaner processes execute log compaction.

Accepted Answer

Log cleaner threads run in the background. They scan compaction topics, group messages by keys, and discard older offsets. The latest record value is retained, along with a marker (tombstone) if deleted, saving space.

Question 27

What is partition skew and how does it degrade throughput?

Accepted Answer

Partition skew occurs when messages are distributed unevenly across partitions. This causes specific broker nodes to experience high CPU and disk load while others remain idle, degrading cluster performance.

Question 28

Explain Kafka transaction processing and transactional IDs.

Accepted Answer

To process messages across topics atomically (read-process-write), configure producers with `transactional.id` and run commands inside `beginTransaction()`/`commitTransaction()` blocks, allowing consumers to read committed data only.

Question 29

What is the difference between offset commit strategies: auto commit vs manual commit?

Accepted Answer

- Auto Commit (`enable.auto.commit=true`): Automatically commits offsets at intervals, which is simple but risks duplicate processing on crashes.
- Manual Commit: Consumer calls `commitSync()` or `commitAsync()` after processing messages, ensuring exact execution.

Question 30

How do you test Kafka schema validations in CI/CD pipelines?

Accepted Answer

Integrate with the Confluent Schema Registry. Write tests that register Avro/JSON schemas, validate that producers reject mismatched payloads, and verify that schemas are backwards compatible before updates.

Question 31

Explain Kafka Streams API and stateless vs stateful operations.

Accepted Answer

Kafka Streams is a client library for building stream processing applications:
- Stateless: Simple mappings or filters on individual messages.
- Stateful: Windowed joins and aggregations on keys, which store states locally in RocksDB databases.

Question 32

What is segment size in Kafka logs and how does it affect compaction?

Accepted Answer

Kafka splits partition logs into segment files on disk (default 1GB). Log compaction and deletions only occur on closed segments; active segments are never cleaned, which is important for memory sizing.

Question 33

How do you manage Kafka client connections leaks?

Accepted Answer

Ensure Kafka clients (producers and consumers) are reused as singletons and closed properly in shutdown hooks. Connection leaks exhaust broker threads, causing timeouts in clusters.

Question 34

Explain Kafka Exactly-Once Semantics (EOS), detailing how idempotent producers, transactional coordinators, and 2PC transactions work.

Accepted Answer

Kafka Exactly-Once Semantics (EOS) guarantees that messages are processed exactly once across read-process-write cycles. Key components:
1. Idempotent Producers: Producers attach unique sequence numbers and producer IDs (PIDs) to messages. If a broker receives duplicate sequence numbers due to network retries, it discards them, avoiding duplicate writes.
2. Transactional Coordinator: A broker node that manages transaction logs.
3. Two-Phase Commit (2PC): When a transaction runs, the coordinator writes the status (prepare/commit) to a `__transaction_state` topic. Once all writes to partition logs confirm, the coordinator writes a commit marker, letting consumers configured with `isolation.level=read_committed` read the data.

Question 35

How would you optimize a Kafka cluster experiencing high controller election times and disk I/O bottlenecks under heavy traffic?

Accepted Answer

Optimize Kafka clusters by:
1. Controller Optimization: Reduce partition counts. Having too many partitions (e.g. > 10k per broker) slows down controller metadata updates and increases election times on broker crashes.
2. Disk I/O: Bind log directories to separate physical SSDs. Tune kernel settings: increase page cache allocations, set `vm.dirty_background_ratio = 5` to flush page caches to disk early, and increase `num.io.threads`.

Question 36

Explain how to secure a Kafka cluster using SASL/SCRAM, SSL/TLS encryption, and ACLs.

Accepted Answer

Secure Kafka by:
1. Encryption in Transit: Enable SSL/TLS encryption for all client-broker and inter-broker communication.
2. Authentication: Configure SASL/SCRAM or SASL/OAUTHBEARER authentication to verify client identities.
3. Authorization: Use Access Control Lists (ACLs) to restrict user access to specific topics (e.g. allowing read/write only on matching paths).

Top 36 Kafka Interview Questions and Answers (2026)

What is Kafka and Why is it Critical in Modern Engineering?

Kafka Lifecycle Visualizer

Core Architectural Concepts in Kafka

Log-Structured Appending

Consumer Group Balancing

Partition Replications

Offset Commit Modes

Zero-Copy Data Pipelines

check_circleWhy Modern Companies Choose Kafka

lightbulbStrategic Preparation Tips

errorCrucial Mistakes to Avoid

trending_upHiring Trends & Career Outlook (2026)

Basics