Table of Contents
Linked Lists
Introduction
PintOS contains a linked list data structure in lib/kernel/list.h that is used for many different purposes. This linked list implementation is different from most other linked list implementations you may have encountered, because it does not use any dynamic memory allocation.
/* List element. */
struct list_elem
{
struct list_elem *prev; /* Previous list element. */
struct list_elem *next; /* Next list element. */
};
/* List. */
struct list
{
struct list_elem head; /* List head. */
struct list_elem tail; /* List tail. */
};
In a PintOS linked list, each list element contains a struct list_elem, which contains the pointers to the next and previous element. Because the list elements themselves have enough space to hold the prev and next pointers, we don’t need to allocate any extra space to support our linked list. Here is an example of a linked list element which can hold an integer:
/* Integer linked list */
struct int_list_elem
{
int value;
struct list_elem elem;
};
Next, you must create a struct list to represent the whole list. Initialize it with list_init().
/* Declare and initialize a list */
struct list my_list;
list_init (&my_list);
Now, you can declare a list element and add it to the end of the list. Notice that the second argument of list_push_back() is the address of a struct list_elem, not the struct int_list_elem itself.
/* Declare a list element. */
struct int_list_elem three = {3, {NULL, NULL}};
/* Add it to the list */
list_push_back (&my_list, &three.elem);
We can use the list_entry() macro to convert a generic struct list_elem into our custom struct int_list_elem type. Then, we can grab the value attribute and print it out:
/* Fetch elements from the list */
struct list_elem *first_list_element = list_begin (&my_list);
struct int_list_elem *first_integer = list_entry (first_list_element,
struct int_list_elem,
elem);
printf("The first element is: %d\n", first_integer->value);
By storing the prev and next pointers inside the structs themselves, we can avoid creating new linked list element containers. However, this also means that a list_elem can only be part of one list a time. Additionally, our list should be homogeneous (it should only contain one type of element).
The list_entry() macro works by computing the offset of the elem field inside of struct int_list_elem. In our example, this offset is 4 bytes. To convert a pointer to a generic struct list_elem to a pointer to our custom struct int_list_elem, the list_entry() just needs to subtract 4 bytes! (It also casts the pointer, in order to satisfy the C type system.)
Linked lists have 2 sentinel elements: the head and tail elements of the struct list. These sentinel elements can be distinguished by their NULL pointer values. Make sure to distinguish between functions that return the first actual element of a list and functions that return the sentinel head element of the list.
There are also functions that sort a link list (using quicksort) and functions that insert an element into a sorted list. These functions require you to provide a list element comparison function (see lib/kernel/list.h for more details).
List API Documentation
This implementation of a doubly linked list does not require use of dynamically allocated memory for the list itself. Please note however, that the list elements most likely still need to be dynamically allocated using malloc and the memory has to be de-allocated at the end using free. Each structure that is a potential list element must embed a struct list_elem member. All of the list functions operate on these struct list_elem’s. The list_entry macro allows conversion from a struct list_elem back to a structure object that contains it.
For example, suppose there is a need for a list of struct foo. In this case, struct foo should contain a struct list_elem member, like so:
struct foo
{
struct list_elem elem;
int bar;
// ...other members...
};
It does not matter at what position inside the struct foo the struct list_elem is positioned. Then a list of struct foo can be be declared and initialized like so:
struct list foo_list;
list_init (&foo_list);
Our doubly linked lists have two header elements: the “head” just before the first element and the “tail” just after the last element. The prev link of the front header is NULL, as is the next link of the back header. Their other two links point toward each other via the interior elements of the list.
An empty list looks like this:
+------+ +------+
NULL <---| head |<--->| tail |---> NULL
+------+ +------+
A list with two elements in it looks like this:
+------+ +-------+ +-------+ +------+
NULL <---| head |<--->| 1 |<--->| 2 |<--->| tail |---> NULL
+------+ +-------+ +-------+ +------+
The symmetry of this arrangement eliminates lots of special cases in list processing. For example, take a look at the implementation of list_remove(): it takes only two pointer assignments and no conditionals. That’s a lot simpler than the code would be without header elements.
Iteration is a typical situation where it is necessary to convert from a struct list_elem back to its enclosing structure. Here’s an example using foo_list:
struct list_elem *e;
for (e = list_begin (&foo_list); e != list_end (&foo_list);
e = list_next (e))
{
struct foo *f = list_entry (e, struct foo, elem);
// ...do something with f...
}
Adding new elements to a list, traversing it, and deleting all elements from the list again looks like:
int main ()
{
// create and initialize list
struct list g_list;
list_init (&g_list);
// Create 10 nodes
struct foo *e;
for (size_t i = 0; i < 10; ++i)
{
e = malloc (sizeof (struct foo));
e->data = i;
list_push_back (&g_list, &e->elem);
}
// iterate over all nodes and print the stored data
struct list_elem *k;
for (k = list_begin (&g_list); k != list_end (&g_list); k = list_next (k))
{
struct foo *f = list_entry (k, struct foo, elem);
printf ("%d\n", f->data);
}
// free all memory associated with the list
for (k = list_begin (&list); k != list_end (&list); k = list_remove (k))
{
struct foo *f = list_entry (k, struct foo, elem);
free(f);
}
return 0;
}
Glossary of list terms
- “front”: The first element in a list. Undefined in an empty list. Returned by
list_front(). - “back”: The last element in a list. Undefined in an empty list. Returned by list_back().
- “tail”: The element figuratively just after the last element of a list. Well defined even in an empty list. Returned by
list_end(). Used as the end sentinel for an iteration from front to back. - “beginning”: In a non-empty list, the front. In an empty list, the tail. Returned by
list_begin(). Used as the starting point for an iteration from front to back. - “head”: The element figuratively just before the first element of a list. Well defined even in an empty list. Returned by
list_rend(). Used as the end sentinel for an iteration from back to front. - “reverse beginning”: In a non-empty list, the back. In an empty list, the head. Returned by
list_rbegin(). Used as the starting point for an iteration from back to front. - “interior element”: An element that is not the head or tail, that is, a real list element. An empty list does not have any interior elements.
List Functions
kernel/list.c implements several functions:
The list_entry macro is an important facility we must comprehend:
#define list_entry(LIST_ELEM, STRUCT, MEMBER) \
((STRUCT *) ((uint8_t *) &(LIST_ELEM)->next \
- offsetof (STRUCT, MEMBER.next)))
Converts pointer to list element
LIST_ELEMinto a pointer to the structure thatLIST_ELEMis embedded inside. Supply the name of the outer structureSTRUCTand the member nameMEMBERof the list element. See the previous section for an example.
void list_init (struct list *l);
Initializes
las an empty list.
struct list_elem *list_begin (struct list *l);
Returns the beginning of
l.
struct list_elem *list_next (struct list_elem *elem);
Returns the element after
elemin its list. Ifelemis the last element in its list, returns the list tail. Results are undefined ifelemis itself a list tail.
struct list_elem *list_end (struct list *l);
Returns
l’s tail.
struct list_elem *list_rbegin (struct list *l);
Returns
l’s reverse beginning, for iterating throughlin reverse order, from back to front.
struct list_elem *list_prev (struct list_elem *elem);
Returns the element before
elemin its list. Ifelemis the first element in its list, returns the list head. Results are undefined ifelemis itself a list head.
struct list_elem *list_rend (struct list *l);
Returns
l’s head.
list_rend()is often used in iterating through a list in reverse order, from back to front. Here’s typical usage, following the example from the top oflist.h:for (e = list_rbegin (&foo_list); e != list_rend (&foo_list); e = list_prev (e)) { struct foo *f = list_entry (e, struct foo, elem); // ...do something with f... }
struct list_elem *list_head (struct list *l);
Returns
l’s head
struct list_elem *list_tail (struct list *l);
Returns
l’s tail.
void list_insert (struct list_elem *before, struct list_elem *elem);
Inserts
elemjust beforebefore, which may be either an interior element or a tail. The latter case is equivalent tolist_push_back().
void list_splice (struct list_elem* before, struct list_elem* first, struct list_elem *last);
Removes elements
firstthoughlast(exclusive) from their current list, then inserts them just beforebefore, which may be either an interior element or a tail.
void list_push_front (struct list *l, struct list_elem *elem);
Inserts
elemat the beginning ofl, so that it becomes the front inl.
void list_push_back (struct list *l, struct list_elem *elem);
Inserts
elemat the end ofl, so that it becomes the back inl.
struct list_elem *list_remove (struct list_elem *elem);
Removes
elemfrom its list and returns the element that followed it. Undefined behavior ifelemis not in a list.
struct list_elem *list_pop_front (struct list *l);
Removes the front element from
land returns it. Undefined behavior iflis empty before removal.
struct list_elem *list_pop_back (struct list *l);
Removes the back element from
land returns it. Undefined behavior iflis empty before removal.
struct list_elem *list_front (struct list *l);
Returns the front element in
l. Undefined behavior iflis empty.
struct list_elem *list_back (struct list *l);
Returns the back element in
l. Undefined behavior iflis empty.
size_t list_size (struct list *l);
Returns the number of elements in
l. Runs in O(n) in the number of elements.
bool list_empty(struct list *l);
Returns true if
lis empty, false otherwise.
void list_reverse (struct list *l);
Reverses the order of
l.
typedef bool list_less_func (const struct list_elem *a, const struct list_elem *b, void *aux);
For sorting, we need to rewrite this function, eg:
bool thread_less_func(struct list_elem *a, struct list_elem *b, void *aux) { return list_entry(a,struct thread,elem)->priority > list_entry(b,struct thread,elem)->priority; }
void list_sort (struct list *l, list_less_func *less, void *aux);
Sorts
laccording tolessgiven auxiliary dataaux, using a natural iterative merge sort that runs inO(n lg n)time andO(1)space in the number of elements inl.
void list_insert_ordered (struct list *l, struct list_elem *elem, list_less_func *less, void *aux);
Inserts
elemin the proper position inl, which must be sorted according tolessgiven auxiliary dataaux. Runs inO(n)average case in the number of elements inl.
void list_unique (struct list *l, struct list *duplicates, list_less_func *less, void *aux);
Iterates through
land removes all but the first in each set of adjacent elements that are equal according tolessgiven auxiliary dataaux. Ifduplicatesis non-NULL, then the duplicated elements fromlare appended toduplicates.
struct list_elem *list_max (struct list *l, list_less_func *less, void *aux);
Returns the element in
lwith the largest value according tolessgiven auxiliary dataaux. If there is more than one maximum, returns the one that appears earlier in the list. If the list is empty, returns its tail.
struct list_elem *list_min (struct list *l, list_less_func *less, void *aux);
Returns the element in
lwith the smallest value according tolessgiven auxiliary dataaux. If there is more than one minimum, returns the one that appears earlier in the list. If the list is empty, returns its tail.