Junior-Level PostgreSQL Interview Questions

Q1:

What is the PostgreSQL psql tool used for?

Junior

Answer

psql is the interactive command-line tool for connecting to PostgreSQL, executing SQL queries, running scripts, and viewing metadata.

Quick Summary: psql is PostgreSQL's interactive command-line client. Connect: psql -h host -U user -d dbname. Run SQL interactively or from files (-f file.sql). Meta-commands: \l (list databases), \dt (list tables), \d tablename (describe table), \i file.sql (run file), \timing (show query time), \copy for CSV import/export. Essential tool for DBA work and debugging.

Permalink

Q2:

What is the purpose of the SERIAL data type?

Junior

Answer

SERIAL creates an auto-incrementing integer column using sequences that automatically generate unique values.

Quick Summary: SERIAL is a shorthand for creating an auto-incrementing integer column. SERIAL creates an INTEGER with a sequence. BIGSERIAL for BIGINT. Modern PostgreSQL prefers: id BIGINT GENERATED ALWAYS AS IDENTITY which is SQL-standard. Both create a sequence that provides the next value on each insert. Use BIGINT/BIGSERIAL for tables that might grow large to avoid integer overflow.

Permalink

Q3:

What is the difference between TEXT and VARCHAR?

Junior

Answer

TEXT stores unlimited variable-length text, whereas VARCHAR can enforce a maximum length. Performance difference is minimal.

Quick Summary: TEXT and VARCHAR are functionally identical in PostgreSQL - both store unlimited length strings with the same performance. VARCHAR(n) enforces a maximum length (inserts longer strings fail). TEXT has no length limit. Unlike other databases, PostgreSQL TEXT is not slower than VARCHAR. Use TEXT unless you have a specific business reason to enforce a maximum length.

Permalink

Q4:

What are ENUM types used for?

Junior

Answer

ENUM restricts a column to predefined string values, useful for static categories such as status fields.

Quick Summary: ENUM types define a static set of allowed string values: CREATE TYPE mood AS ENUM ("happy", "sad", "neutral"). Use as column type: mood mood NOT NULL. Storage is efficient (1-4 bytes). Values have a defined order (useful for BETWEEN, ORDER BY). Downside: adding new values requires ALTER TYPE - needs exclusive lock in older PostgreSQL (non-blocking in newer versions).

Permalink

Q5:

What is a CHECK constraint?

Junior

Answer

CHECK ensures column values follow specific rules, enforcing data integrity at the database level.

Quick Summary: CHECK constraint enforces a condition on column values: age INTEGER CHECK (age >= 0 AND age <= 150). Table-level CHECK for multi-column conditions: CHECK (end_date > start_date). Fails silently for NULL values (NULL is not checked - it passes). PostgreSQL validates CHECK on INSERT and UPDATE. NOT NULL is a special CHECK that prevents NULL. Use for business rule enforcement.

Permalink

Q6:

What is the purpose of the RETURNING clause in PostgreSQL?

Junior

Answer

RETURNING returns columns from rows affected by INSERT, UPDATE, or DELETE, useful for retrieving generated IDs.

Quick Summary: RETURNING clause returns values from rows affected by INSERT, UPDATE, or DELETE - like a SELECT on modified rows. INSERT ... RETURNING id gets the auto-generated ID. UPDATE ... RETURNING new_value gets updated values. DELETE ... RETURNING * shows what was deleted. Eliminates a second SELECT query. More efficient and atomic than insert-then-select.

Permalink

Q7:

What is the difference between INNER JOIN and FULL OUTER JOIN?

Junior

Answer

INNER JOIN returns only matching rows, while FULL OUTER JOIN returns all rows from both tables with NULLs for non-matching data.

Quick Summary: INNER JOIN returns only rows with matching values in both tables (intersection). LEFT JOIN returns all rows from the left table plus matched rows from the right (unmatched right rows are NULL). RIGHT JOIN is the reverse. FULL OUTER JOIN returns all rows from both tables - matched where possible, NULL where not. Most commonly used: INNER JOIN for related data, LEFT JOIN to include optional relations.

Permalink

Q8:

What is a sequence in PostgreSQL?

Junior

Answer

A sequence is an independent object that generates incremental numeric values, commonly used for primary keys.

Quick Summary: A sequence is an independent database object that generates unique integer values in order. Used to auto-increment primary keys. CREATE SEQUENCE seq_name. Get next value: nextval("seq_name"). Current value: currval(). SERIAL/IDENTITY columns use sequences internally. Sequences are not transaction-aware - a rolled-back transaction still advances the sequence, creating gaps (this is expected and fine).

Permalink

Q9:

What is the purpose of the COALESCE function?

Junior

Answer

COALESCE returns the first non-NULL value from a list, useful for handling missing data.

Quick Summary: COALESCE returns the first non-NULL value from a list of arguments. COALESCE(col, "default") returns col if not NULL, else "default". Useful for: providing fallback values in queries, replacing NULLs for display. COALESCE(a, b, c) returns first non-NULL of a, b, c. Related: NULLIF(a, b) returns NULL if a equals b. Both are null-handling utilities used in SELECT columns and WHERE conditions.

Permalink

Q10:

What are indexes in PostgreSQL and why are they important?

Junior

Answer

Indexes speed up reads by creating fast lookup structures. PostgreSQL supports B-tree, Hash, GIN, GiST, SP-GiST, and BRIN.

Quick Summary: Indexes speed up data retrieval by creating a sorted data structure (B-tree by default) on one or more columns. Without an index, PostgreSQL scans every row (seq scan). With an index, it jumps directly to matching rows (index scan). Tradeoff: faster reads but slower writes (index updated on every INSERT/UPDATE/DELETE). Index columns frequently used in WHERE, JOIN, and ORDER BY.

Permalink

Q11:

What is a partial index?

Junior

Answer

A partial index applies only to rows meeting a condition, reducing storage and improving performance.

Quick Summary: Partial index indexes only rows matching a WHERE condition. Example: CREATE INDEX ON orders (user_id) WHERE status = "active". Smaller than a full index (only active orders indexed), faster to maintain, and PostgreSQL can only use it when the query matches the WHERE condition. Use when most queries target a subset of rows (active records, recent data, non-null values).

Permalink

Q12:

What is a composite index?

Junior

Answer

A composite index includes multiple columns and is used left to right, making column order important for optimization.

Quick Summary: Composite index covers multiple columns: CREATE INDEX ON orders (user_id, created_at). Useful when queries filter on multiple columns together. Index prefix rule: a composite index on (a, b, c) can be used for queries on (a), (a, b), or (a, b, c) but NOT (b) alone. Put the most selective and most frequently filtered column first.

Permalink

Q13:

What is a materialized view?

Junior

Answer

A materialized view stores query results physically and must be refreshed manually, improving performance for expensive queries.

Quick Summary: Materialized view stores the result of a query on disk - like a cached table. Faster to query than a regular view (no re-execution of the underlying query). Requires manual refresh: REFRESH MATERIALIZED VIEW viewname. REFRESH CONCURRENTLY allows reads during refresh (requires unique index). Use for expensive aggregate queries or reports that are expensive to recompute on every access.

Permalink

Q14:

What is the difference between DELETE and TRUNCATE in PostgreSQL?

Junior

Answer

DELETE removes rows with full logging. TRUNCATE removes all rows instantly, resets identities, and is minimally logged.

Quick Summary: DELETE: removes matching rows, logged in WAL (slower for large deletions), fires triggers, can use WHERE, transactional. TRUNCATE: removes all rows very fast (doesn't scan rows), minimal WAL logging, fires truncate triggers, can't use WHERE. Both are transactional in PostgreSQL (unlike MySQL MyISAM). TRUNCATE is the right choice when clearing entire tables for ETL or test data cleanup.

Permalink

Q15:

What is the VACUUM operation in PostgreSQL?

Junior

Answer

VACUUM removes dead tuples, reclaims storage, improves performance, and prevents transaction ID wraparound.

Quick Summary: VACUUM reclaims storage from dead tuples created by PostgreSQL's MVCC - rows that were deleted or updated but not yet physically removed. Dead tuples occupy space and slow down queries. VACUUM marks space as reusable (doesn't shrink file). VACUUM ANALYZE also updates query planner statistics. Autovacuum runs automatically. Run manually if autovacuum is falling behind on high-traffic tables.

Permalink

Q16:

What is the ANALYZE command used for?

Junior

Answer

ANALYZE collects table statistics used by the query optimizer to choose efficient execution plans.

Quick Summary: ANALYZE collects statistics about table column value distribution and stores them in pg_statistic. The query planner uses these statistics to estimate row counts and choose the best execution plan. Stale statistics lead to bad plan choices (wrong join order, wrong index). Run ANALYZE after bulk data loads or large changes. Autovacuum runs ANALYZE automatically when tables change significantly.

Permalink

Q17:

What is the difference between a function and a stored procedure in PostgreSQL?

Junior

Answer

Functions return values and can be used in queries, while procedures do not return values and can manage transactions.

Quick Summary: Functions (CREATE FUNCTION) return a value and can be used in SELECT, WHERE, etc. They can't manage transactions. Stored procedures (CREATE PROCEDURE, PostgreSQL 11+) don't need to return a value and can COMMIT/ROLLBACK within their body - enabling transaction control inside procedures. Call procedures with CALL procedure_name(). Functions are more versatile for query use; procedures for transaction control.

Permalink

Q18:

What is PostgreSQL’s JSONB data type?

Junior

Answer

JSONB stores JSON in a binary format for fast searching and indexing, supporting advanced JSON operations.

Quick Summary: JSONB stores JSON as decomposed binary - faster to query than JSON (which stores as text). JSONB supports: indexing with GIN indexes, operators for key lookup (->), path extraction (#>), containment (@>), and key existence (?). Use JSONB when you need to query JSON fields. Use JSON only when you need to preserve key order or exact formatting. Most use cases should use JSONB.

Permalink

Q19:

What are foreign data wrappers (FDW)?

Junior

Answer

FDWs allow PostgreSQL to query external data sources like MySQL, MongoDB, APIs, or files as if they were local.

Quick Summary: Foreign Data Wrappers (FDW) allow PostgreSQL to query external data sources as if they were local tables. Connect to: other PostgreSQL databases (postgres_fdw), MySQL, CSV files, Redis, S3, REST APIs. Install extension, create a server, define user mapping, then create foreign tables. Useful for data federation and migrating data across systems without ETL pipelines.

Permalink

Q20:

What is the purpose of the EXPLAIN command?

Junior

Answer

EXPLAIN shows how a query will execute, including index usage, join types, and cost estimates.

Quick Summary: EXPLAIN shows the query execution plan PostgreSQL will use without actually running the query. EXPLAIN ANALYZE runs the query and shows actual vs estimated row counts and execution times. Look for: sequential scans on large tables (may need an index), hash joins vs nested loops, rows estimates vs actual rows (big gap = stale statistics). Essential for performance debugging.

Permalink

Get Pro for Free

Junior PostgreSQL Interview Questions

PostgreSQL Interview Questions & Answers

What you will learn from these PostgreSQL interview questions:

Questions

What is the PostgreSQL psql tool used for?

Answer

What is the purpose of the SERIAL data type?

Answer

What is the difference between TEXT and VARCHAR?

Answer

What are ENUM types used for?

Answer

What is a CHECK constraint?

Answer

What is the purpose of the RETURNING clause in PostgreSQL?

Answer

What is the difference between INNER JOIN and FULL OUTER JOIN?

Answer

What is a sequence in PostgreSQL?

Answer

What is the purpose of the COALESCE function?

Answer

What are indexes in PostgreSQL and why are they important?

Answer

What is a partial index?

Answer

What is a composite index?

Answer

What is a materialized view?

Answer

What is the difference between DELETE and TRUNCATE in PostgreSQL?

Answer

What is the VACUUM operation in PostgreSQL?

Answer

What is the ANALYZE command used for?

Answer

What is the difference between a function and a stored procedure in PostgreSQL?

Answer

What is PostgreSQL’s JSONB data type?

Answer

What are foreign data wrappers (FDW)?

Answer

What is the purpose of the EXPLAIN command?

Answer

Curated Sets for PostgreSQL

People Also Ask - Related PostgreSQL Questions