Entry-Level MS SQL Interview Questions

Q1:

What is MS SQL Server and where is it commonly used?

Entry

Answer

MS SQL Server is Microsoft’s enterprise relational database system used in finance, e-commerce, SaaS, ERPs, CRMs, and large web applications. It provides secure, consistent, and high-performance data storage.

Quick Summary: MS SQL Server is Microsoft's relational database system built for enterprise use. It stores, retrieves, and manages structured data reliably. You'll find it in finance, healthcare, SaaS apps, ERPs, and large websites where data consistency and performance really matter.

Permalink

Q2:

Explain Database, Table, Row, and Column in relational terms.

Entry

Answer

A database stores all data objects. A table contains structured data. A row represents one record. A column represents a specific data field with a defined type.

Quick Summary: A database holds all your data objects. A table organizes data into rows and columns, like a spreadsheet. A row is one record (one person, one order). A column is a specific attribute of that record (name, price, date). Together, they form the foundation of relational data storage.

Permalink

Q3:

What is a Primary Key and why is it important?

Entry

Answer

A primary key uniquely identifies each row. It ensures uniqueness, improves indexing, and maintains reliable data integrity.

Quick Summary: A primary key uniquely identifies each row in a table â€” no two rows can have the same value, and it can't be NULL. It's the anchor for relationships between tables and speeds up lookups. Without it, finding or linking a specific record becomes messy and unreliable.

Permalink

Q4:

What is a Foreign Key and what problem does it solve?

Entry

Answer

A foreign key creates a relationship between tables and prevents orphaned or invalid data by enforcing referential integrity.

Quick Summary: A foreign key in one table points to the primary key in another table. It enforces referential integrity â€” you can't add an order for a customer that doesn't exist. It's how relational databases maintain relationships between entities without duplicating data.

Permalink

Q5:

What are Constraints in MS SQL?

Entry

Answer

Constraints like primary key, foreign key, unique, check, not null, and default rules ensure accurate, valid, and reliable data.

Quick Summary: Constraints are rules you enforce at the database level. Common ones: NOT NULL (column can't be empty), UNIQUE (no duplicates), CHECK (value must meet a condition), DEFAULT (fallback value), PRIMARY KEY, and FOREIGN KEY. They catch bad data before it ever lands in the table.

Permalink

Q6:

What is Normalization and why do we use it?

Entry

Answer

Normalization organizes data to eliminate redundancy, improve consistency, and reduce anomalies during insert, update, or delete operations.

Quick Summary: Normalization organizes data to reduce redundancy and keep things consistent. Instead of storing a customer's name in every order row, you store it once and reference it by ID. This prevents update anomalies â€” you change data in one place, not a hundred.

Permalink

Q7:

Explain 1NF, 2NF, and 3NF briefly.

Entry

Answer

1NF: No repeating groups; atomic values.
2NF: Remove partial dependencies.
3NF: Remove transitive dependencies.

Quick Summary: 1NF: each cell holds one atomic value, no repeating groups. 2NF: no partial dependency â€” every non-key column depends on the whole primary key. 3NF: no transitive dependency â€” non-key columns depend only on the primary key, not on each other. Each level builds on the previous.

Permalink

Q8:

What is Denormalization and when is it used?

Entry

Answer

Denormalization introduces controlled redundancy to improve read performance, especially in reporting and analytics systems.

Quick Summary: Denormalization intentionally adds redundancy to speed up reads. Instead of joining three tables every time, you store the data together. Used in reporting databases, dashboards, and read-heavy systems where join performance is a bottleneck and write consistency is less critical.

Permalink

Q9:

What is an Index and why is it important?

Entry

Answer

An index speeds up data searches by avoiding full table scans. Too many indexes slow down inserts/updates due to maintenance.

Quick Summary: An index is a separate data structure that helps SQL Server find rows fast without scanning the whole table. Think of it like a book's index â€” instead of reading every page, you jump straight to what you need. Without indexes, every query does a full table scan.

Permalink

Q10:

What is the difference between Clustered and Non-Clustered Index?

Entry

Answer

Clustered index: Defines physical row order.
Non-clustered index: A separate structure pointing to actual rows.

Quick Summary: Clustered index defines the physical order of rows in the table â€” only one per table. Non-clustered index is a separate structure with pointers back to the actual rows â€” you can have many. Clustered is faster for range queries; non-clustered is better for selective lookups on specific columns.

Permalink

Q11:

What is a View and why is it used?

Entry

Answer

A view is a virtual table created from a query. Used for simplicity, security, reusability, and hiding complex queries.

Quick Summary: A view is a saved SELECT query that looks like a table. It simplifies complex joins, hides sensitive columns, and gives different users a tailored perspective of the data. The view itself stores no data â€” it's just a reusable query definition that runs when you access it.

Permalink

Q12:

What is a Stored Procedure and what is it used for?

Entry

Answer

A stored procedure is a precompiled SQL code block used for business logic, improving performance and reducing network overhead.

Quick Summary: A stored procedure is pre-compiled SQL code saved in the database. You call it by name with parameters. It runs faster than ad-hoc SQL because the execution plan is cached, reduces network traffic, centralizes business logic, and is easier to secure with permissions.

Permalink

Q13:

What are User-Defined Functions (UDFs)?

Entry

Answer

UDFs return calculated values based on parameters. Used for reusable logic, validation, and transformations.

Quick Summary: UDFs are reusable functions you define in T-SQL. Scalar functions return one value. Inline table-valued functions return a table (like a parameterized view). Multi-statement TVFs build a table row by row. Scalar UDFs are handy but can tank performance when called per row in large queries.

Permalink

Q14:

What is a Transaction and why is it important?

Entry

Answer

A transaction ensures a group of SQL operations is treated as a single unit—either all succeed or all fail (ACID compliance).

Quick Summary: A transaction is a group of SQL operations treated as one unit. Either all succeed and commit, or all fail and roll back. This prevents half-done operations from corrupting your data â€” like a bank transfer where money leaves one account but never arrives in another.

Permalink

Q15:

Explain ACID properties in SQL.

Entry

Answer

Atomicity: All-or-nothing.
Consistency: Valid states only.
Isolation: No interference between transactions.
Durability: Data persists after commit.

Quick Summary: ACID stands for: Atomicity (all or nothing), Consistency (data stays valid before and after), Isolation (transactions don't interfere with each other), Durability (once committed, data survives crashes). These four guarantees make transactions reliable in production systems.

Permalink

Q16:

What is a Deadlock in SQL Server?

Entry

Answer

A deadlock occurs when transactions block each other while waiting for resources. SQL Server resolves it by killing one transaction.

Quick Summary: A deadlock happens when two transactions each hold a lock the other needs â€” both wait forever. SQL Server's deadlock monitor detects the cycle and kills the cheaper transaction (the deadlock victim), rolling it back and returning an error. The other transaction then completes normally.

Permalink

Q17:

What is Locking in MS SQL Server?

Entry

Answer

Locking controls concurrent access using shared (read), exclusive (write), and update locks to maintain data consistency.

Quick Summary: Locking prevents concurrent transactions from corrupting shared data. SQL Server automatically applies locks (shared for reads, exclusive for writes) when a transaction touches data. The granularity ranges from row to page to table. Poor lock management causes blocking, contention, and deadlocks.

Permalink

Q18:

What is a Schema in SQL?

Entry

Answer

A schema is a logical container grouping tables, views, and procedures for better organization and security.

Quick Summary: A schema is a logical namespace inside a database that groups related objects â€” tables, views, procedures. Like folders for your database objects. dbo is the default schema. Schemas help with organization, security (grant access per schema), and avoiding naming conflicts between teams.

Permalink

Q19:

Difference between DELETE and TRUNCATE?

Entry

Answer

DELETE: Row-by-row removal, fully logged.
TRUNCATE: Deallocates pages, extremely fast, minimal logging.

Quick Summary: DELETE removes rows one at a time, logs each deletion, fires triggers, and can be rolled back. TRUNCATE removes all rows at once, logs only page deallocations, is much faster, but can't be used with triggers or foreign key constraints. TRUNCATE resets identity columns; DELETE does not.

Permalink

Q20:

What is an Execution Plan and why is it important?

Entry

Answer

An execution plan shows how SQL Server executes a query—joins, indexes, scans, seeks. It is vital for tuning slow queries.

Quick Summary: An execution plan is SQL Server's roadmap for running a query â€” which indexes to use, how to join tables, in what order to process operations. Reading the plan reveals bottlenecks like table scans, key lookups, or hash matches that slow things down. It's the first tool to reach for when tuning.

Permalink

Q21:

What is the main purpose of indexing in SQL Server?

Entry

Answer

Indexing improves data retrieval speed by avoiding full table scans.

Indexes act like optimized search maps that help SQL Server locate rows faster.

However, too many indexes slow down write performance due to maintenance overhead.

Quick Summary: Indexing lets SQL Server find rows without reading the entire table. Instead of scanning a million rows, it jumps to the right spot using the index structure (B-tree). Without good indexes, even simple lookups turn into full table scans that waste CPU, memory, and time.

Permalink

Q22:

How does SQL Server decide between a Table Scan, Index Scan, and Index Seek?

Entry

Answer

SQL Server chooses access methods based on filters, available indexes, and cost estimates.

Table Scan: Reads all rows when no useful index exists.

Index Scan: Reads all index entries for broad filters.

Index Seek: Most efficient; jumps directly to matching rows.

Quick Summary: SQL Server uses statistics (row count estimates) to pick the cheapest plan. Table Scan reads every row â€” used when there's no useful index or the whole table is needed. Index Scan reads the index but all rows. Index Seek jumps directly to matching rows â€” the most efficient, what you want for selective queries.

Permalink

Q23:

What is a Composite Index and when is it used?

Entry

Answer

A composite index includes multiple columns, used when queries filter or sort using those column combinations.

The order of columns is important because SQL Server can only efficiently use leading key columns.

Quick Summary: A composite index covers multiple columns. Useful when your WHERE clause filters on a combination of columns together. The column order matters â€” the index is most effective when the leading column appears in the filter. A (LastName, FirstName) index won't help a query filtering only on FirstName.

Permalink

Q24:

What is Bookmark Lookup and why can it cause performance problems?

Entry

Answer

Bookmark lookup occurs when SQL uses a non-clustered index but must fetch extra columns from the base table.

This becomes slow when many rows match, increasing I/O and reducing performance.

Quick Summary: Bookmark lookup happens when a non-clustered index finds the matching row keys, but the query needs columns not in the index â€” so SQL Server does an extra lookup into the clustered index to get them. Each lookup is a random I/O. With many rows, this becomes very expensive. Fix: use a covering index.

Permalink

Q25:

What is a Covering Index and why is it powerful?

Entry

Answer

A covering index contains all columns needed for a query.

SQL Server can return results from the index alone without touching the table.

This greatly improves performance for read-heavy queries.

Quick Summary: A covering index includes all the columns a query needs â€” the index columns plus extra INCLUDE columns. SQL Server can answer the query entirely from the index without touching the main table. Eliminates key lookups, reduces I/O, and often cuts query time dramatically.

Permalink

Q26:

What are SQL Server Statistics and why are they essential?

Entry

Answer

Statistics describe data distribution in columns.

They help the optimizer estimate row counts and choose efficient execution plans.

Quick Summary: Statistics are histograms that tell SQL Server how data is distributed in a column â€” how many distinct values exist, how they're spread, what the min/max are. The query optimizer uses them to estimate how many rows a filter will return and pick the best execution plan.

Permalink

Q27:

How do outdated or missing statistics affect query performance?

Entry

Answer

Outdated statistics produce inaccurate row estimates.

SQL Server may choose inefficient join types or scans, causing slow performance.

Regular statistic updates maintain optimal plans.

Quick Summary: Outdated statistics mean SQL Server is working with stale data distribution information. It might think a filter returns 100 rows when it actually returns 100,000 â€” leading to a plan built for small data that falls apart at scale. This causes unexpected table scans, wrong join types, and slow queries.

Permalink

Q28:

What is Parameter Sniffing and why does it occur?

Entry

Answer

SQL Server caches execution plans for reuse.

Parameter sniffing occurs when a cached plan works well for one parameter but poorly for another due to data skew.

Quick Summary: Parameter sniffing is when SQL Server compiles a stored procedure plan based on the first parameter values it sees. If those values are atypical (e.g., a rare ID vs. a common status), the cached plan is optimal for that one call but terrible for everyone else. Leads to inconsistent query performance.

Permalink

Q29:

What is an Execution Plan and how does SQL Server generate it?

Entry

Answer

An execution plan is SQL Server’s strategy for running a query.

The optimizer evaluates multiple plan options and picks the lowest-cost one.

Quick Summary: When SQL Server receives a query, it parses it, checks the plan cache for a reusable plan, then the query optimizer evaluates multiple possible plans using cost estimates and picks the cheapest one. The plan is cached and reused. For parameterized queries, the same plan handles all values â€” good and bad.

Permalink

Q30:

What is the difference between Estimated and Actual Execution Plans?

Entry

Answer

Estimated plans show predicted row counts before execution.

Actual plans include real runtime row counts and performance metrics.

Quick Summary: Estimated plan shows what SQL Server planned to do before running the query â€” based on statistics. Actual plan shows what really happened including actual row counts, execution counts, and time. Comparing estimated vs actual row counts reveals where statistics are wrong and the plan went off-course.

Permalink

Q31:

What is Cardinality Estimation and why is it important?

Entry

Answer

Cardinality estimation predicts how many rows a query will process.

Accurate estimates are critical for choosing efficient join types and memory usage.

Quick Summary: Cardinality estimation predicts how many rows each operation in a plan will produce. If the estimate is 10 rows but the real result is 100,000, SQL Server builds the wrong plan â€” wrong join type, wrong memory grant, wrong index choice. Accurate statistics are what keep cardinality estimates correct.

Permalink

Q32:

What are Index Fragmentation and Fill Factor?

Entry

Answer

Fragmentation occurs when index pages become disordered, slowing scans and seeks.

Fill factor controls how much free space to leave during index creation to reduce fragmentation.

Quick Summary: Fragmentation happens when index pages get out of order after many inserts, updates, and deletes. Logical fragmentation means pages are out of sequence â€” sequential reads become random I/O. Fill Factor controls how full each page is when the index is built, leaving room to grow and reducing future splits.

Permalink

Q33:

What is the Query Optimizer and how does it work?

Entry

Answer

The optimizer evaluates many query strategies using statistics and metadata.

It selects the lowest estimated cost plan.

Quick Summary: The query optimizer is SQL Server's brain for plan selection. It generates multiple candidate plans, estimates the cost of each using statistics and cardinality, and picks the one with the lowest estimated cost. It's cost-based, not rule-based â€” which is why bad statistics lead directly to bad plans.

Permalink

Q34:

What is a Filtered Index and when is it beneficial?

Entry

Answer

A filtered index stores only rows matching a condition.

It reduces index size and improves seek performance for selective queries.

Quick Summary: A filtered index only indexes rows that match a WHERE condition. If 90% of your rows are "archived" and queries only touch "active" ones, a filtered index on IsActive = 1 is tiny, fast, and highly selective. Much cheaper than a full index and often a perfect fit for status or soft-delete columns.

Permalink

Q35:

What is a Hint and when should it be used?

Entry

Answer

Hints override optimizer decisions by forcing specific behavior.

They should be used only when the optimizer consistently chooses suboptimal plans.

Quick Summary: A hint forces SQL Server to use a specific index, join type, or behavior instead of letting the optimizer decide. Use sparingly â€” hints override the optimizer's judgment, which can help in edge cases but often makes things worse long-term as data changes. Prefer fixing statistics or schema over using hints.

Permalink

Q36:

What is a Hotspot in indexing terms?

Entry

Answer

A hotspot occurs when many concurrent operations target the same physical index location.

This causes locking and contention, reducing performance.

Quick Summary: A hotspot is a single index page that many concurrent inserts target at the same time â€” typically the last page of a sequential (identity-based) clustered index. All writers compete for the same page latch, causing contention and blocking. Solutions: GUID keys, partitioning, or using a non-sequential key.

Permalink

Q37:

What is the Role of the Query Store in SQL Server?

Entry

Answer

Query Store tracks query history, execution plans, and performance trends.

It helps diagnose regressions and enforce stable plans.

Quick Summary: Query Store is a built-in SQL Server feature that captures query text, execution plans, and runtime stats over time. When a query suddenly gets slower (plan regression), Query Store can show you the old fast plan and let you force it back. Essential for diagnosing intermittent performance problems.

Permalink

Q38:

What is an Index Seek Predicate vs Residual Predicate?

Entry

Answer

Seek predicates allow precise navigation through an index.

Residual predicates apply filters after the seek when the index does not fully cover the query.

Quick Summary: Seek predicate is the condition used to enter the index â€” it actually narrows which rows are retrieved. Residual predicate is a secondary filter applied after the seek, on rows already found. A residual predicate means SQL Server fetched more rows than needed and filtered them post-retrieval â€” less efficient.

Permalink

Q39:

Why does SQL Server choose a Hash Match instead of Nested Loop or Merge Join?

Entry

Answer

SQL Server uses hash match for large, unsorted datasets where hashing is cheaper than repeated seeks.

Poor row estimates may also cause SQL to choose hashing, sometimes leading to spills.

Quick Summary: SQL Server chooses Hash Match when it can't sort both inputs efficiently (no useful index, large unsorted data). It builds a hash table from the smaller input and probes it with the larger. It's memory-intensive and can spill to TempDB if the memory grant was underestimated. More common with missing or outdated indexes.

Permalink

Get Pro for Free

Entry MS SQL Interview Questions

MS SQL Interview Questions & Answers

What you will learn from these MS SQL interview questions:

Questions

What is MS SQL Server and where is it commonly used?

Answer

Explain Database, Table, Row, and Column in relational terms.

Answer

What is a Primary Key and why is it important?

Answer

What is a Foreign Key and what problem does it solve?

Answer

What are Constraints in MS SQL?

Answer

What is Normalization and why do we use it?

Answer

Explain 1NF, 2NF, and 3NF briefly.

Answer

What is Denormalization and when is it used?

Answer

What is an Index and why is it important?

Answer

What is the difference between Clustered and Non-Clustered Index?

Answer

What is a View and why is it used?

Answer

What is a Stored Procedure and what is it used for?

Answer

What are User-Defined Functions (UDFs)?

Answer

What is a Transaction and why is it important?

Answer

Explain ACID properties in SQL.

Answer

What is a Deadlock in SQL Server?

Answer

What is Locking in MS SQL Server?

Answer

What is a Schema in SQL?

Answer

Difference between DELETE and TRUNCATE?

Answer

What is an Execution Plan and why is it important?

Answer

What is the main purpose of indexing in SQL Server?

Answer

How does SQL Server decide between a Table Scan, Index Scan, and Index Seek?

Answer

What is a Composite Index and when is it used?

Answer

What is Bookmark Lookup and why can it cause performance problems?

Answer

What is a Covering Index and why is it powerful?

Answer

What are SQL Server Statistics and why are they essential?

Answer

How do outdated or missing statistics affect query performance?

Answer

What is Parameter Sniffing and why does it occur?

Answer

What is an Execution Plan and how does SQL Server generate it?

Answer

What is the difference between Estimated and Actual Execution Plans?

Answer

What is Cardinality Estimation and why is it important?

Answer

What are Index Fragmentation and Fill Factor?

Answer

What is the Query Optimizer and how does it work?

Answer

What is a Filtered Index and when is it beneficial?

Answer

What is a Hint and when should it be used?

Answer

What is a Hotspot in indexing terms?

Answer

What is the Role of the Query Store in SQL Server?

Answer

What is an Index Seek Predicate vs Residual Predicate?