You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
tech-interview-handbook/contents/algorithms/hash-table.md

4.0 KiB

id title toc_max_heading_level
hash-table Hash Table 2

Introduction

A hash table (commonly referred to as hash map) a data structure that implements an associative array abstract data type, a structure that can map keys to values. A hash table uses a hash function on an element to compute an index, also called a hash code, into an array of buckets or slots, from which the desired value can be found. During lookup, the key is hashed and the resulting hash indicates where the corresponding value is stored.

Hashing is the most common example of a space-time tradeoff. Instead of linearly searching an array every time to determine if an element is present, which takes O(n) time, we can traverse the array once and hash all the elements into a hash table. Determining if the element is present is a simple matter of hashing the element and seeing if it exists in the hash table, which is O(1) on average.

In the case of hash collisions, there are a number of collision resolution techniques that can be used. You will unlikely be asked about details of collision resolution techniques in interviews,

  • Separate chaining - A linked list is used for each value, so that it stores all the collided items.
  • Open addressing - All entry records are stored in the bucket array itself. When a new entry has to be inserted, the buckets are examined, starting with the hashed-to slot and proceeding in some probe sequence, until an unoccupied slot is found

Learning resources

Implementations

Language API
C++ std::map
Java java.util.Map. Use java.util.HashMap or java.util.TreeMap (preferred)
Python dict
JavaScript Object or Map

Time complexity

Operation Big-O Note
Access N/A Accessing not possible as the hash code is not known
Search O(1)*
Insert O(1)*
Remove O(1)*

* This is the average case, but in interviews we only care about the average case for hash tables.

Sample questions

  • Describe an implementation of a least-used cache, and big-O notation of it.
  • A question involving an API's integration with hash map where the buckets of hash map are made up of linked lists.

import AlgorithmCourses from '../_courses/AlgorithmCourses.md'