Data structures

Educational level: this is a tertiary (university) resource.

Subject classification: this is an information technology resource.

Subject classification: this is a mathematics resource.

Completion status: this resource is ~25% complete.

A portrait of Fibonacci, after whom Fibonacci heaps are named. These heaps enable O(1) insert time, amortized O(lg n) deletemin time and amortized O(1) decreasekey time, making them asymptotically faster than the basic binary heap structure discussed in this course.

Data structures help you organize and process your data. There are many different ways of implementing them depending on available resources and whims of the programmer, but here are the general ideas behind them:

Readings

Choosing a data structure

The type of data structure you want to use will often be determined by how quickly you need to be able to do certain things to the data and how often, with compromises sometimes made for hardware or network restrictions.

Simple data structures

Stacks

A stack is a data structure that supports first-in-last-out access to elements, meaning the most recently added element is the first to be removed. Stacks have two main operations, namely Push() and Pop(). Push() adds an element to the top of the stack, while Pop() removes the element at the top of the stack. You can think of it as a stack of plates: you can 'push' additional items onto the stack of plates or 'pop' plates from the top of the stack.

Stacks are usually implemented through a linked list.

IIT Video: Stacks

/*needs pictures*/

Linked lists

Think of a linked list as a series of boxes(called nodes) in a row. Each piece of information (or set of information) is put into one box with a pointer to the next box. A doubly linked list is one that also has pointers that go back the other way to the previous box. They have head and tail pointers to help you keep track of where the beginning and end are, and usually at least one pointer that moves around the inside of the structure to point at a box to help you keep your place as you look for things.

/*needs pictures*/

For example, suppose you wanted to keep the names, addresses, and birthdays of your friends in a linked list. Each node would have one friend's name, address, and birthday in it, plus a pointer to the next in the list. If you want, the list can be sorted as you add friends to it, based on their name, address, or birthday in whatever way you want. If you know that some friends are more important to you and you don't want to go through the whole list to look for them every time, you can add in another variable for each person that can be used to set a sorting priority.

IIT Video: Queues and Linked Lists

Queues

A Queue is a data structure that provides first-in-first-out access to elements. The two basic operations are Enqueue() and Dequeue(). Enqueue() adds an element to the back of the Queue. Dequeue() removes an element from the front of the queue. Just like a line at the supermarket, a Queue only supports adding items to the back and removing them from the front. In addition, some implementations allow you to 'peek' at the item in front without removing it.

/*needs pictures*/

Dictionaries

IIT Video: Dictionaries

Hash tables

Trees

You can think of a tree structure as a linked list with more than one outgoing pointer per node. This way, it branches out and the ends are called leaves instead of tails. The top node is called the root and the branches, like a real tree, don't merge back together. Thus, the nodes all have one incoming pointer and zero or more outgoing pointers, depending on the type of tree, its location within it, and the set of data that is shaping the tree.

IIT Video: Trees

Binary Search Trees

A binary tree is a specific type of tree with 0, 1, or 2 child nodes.

Traversal techniques

There are three main ways to process data in a tree. Recursion is usually the simplest way to perform such a task, where "traverse left" and "traverse right" below are recursive functions calls with the left and right children, respectively.

Preorder: process node, traverse left, traverse right

Inorder: traverse left, process node, traverse right

Postorder: traverse left, traverse right, process node

For example, consider the following recursive function to display the elements in a tree:

Routine DisplayElements( Node )
   if Node = null then return
   DisplayElements( Node's left )           //recursive function call with left child
   DisplayNode( Node's value )             //display value at current node
   DisplayElements( Node's right )        //recursive function call with left child
End Routine

This is a simple inorder traversal.

IIT Video: Tree Walks / Traversals

AVL Trees

2-4 Trees

IIT Video: 2-4 Trees

Red-Black Trees

Disk-based data structures

IIT Video: Disk-Based Data Structures

Pattern matching

Data compression

IIT Video: Data Compression

Priority queues

Sorting

Graphs

IIT Video: Graphs

Breadth-First Search

Depth-First Search

Minimum Spanning Trees

Shortest Paths

Dynamic allocation

Dynamic allocation asks for memory as it is needed.

Lecture Notes

Data structures lecture notes from University of Maryland, College Park

Assignments

ADUni:

College of William and Mary:

IIT Delhi:

More:

Exams

MIT, College of William and Mary

Note that these exams will cover material outside the scope of this course.

Supplementary Links

A short overview of basic data structures: Video: An Introduction To Trees & Data Structures, Video: The Queue Data Structure
Video lecture series:
- Steven Skeina's algorithms and data structures course at Stony Brook University
- MIT algorithms and data structures
- UC Berkeley CS 61B: Data Structures Fall 2006, Fall 2008 (other years available on same site)
- Digita University algorithms (and data structures) course
- UNSW Data Structures and Algorithms (partial)

Readings

Choosing a data structure

Simple data structures

Stacks

Linked lists

Queues

Dictionaries

Hash tables

Trees

Binary Search Trees

Traversal techniques

AVL Trees

2-4 Trees

Red-Black Trees

Disk-based data structures

Pattern matching

Data compression

Priority queues

Sorting

Graphs

Breadth-First Search

Depth-First Search

Minimum Spanning Trees

Shortest Paths

Dynamic allocation

Lecture Notes

Assignments

Exams

Supplementary Links

See also