Fundamental Data Structures and Algorithms Concepts

Posted on Dec 8, 2025 in Computers

A stack is a linear data structure that follows the Last In First Out (LIFO) principle. This means that the last element added to the stack is the first one to be removed. A stack can be visualized like a stack of plates; you can only add or remove the top plate.

Key operations of a stack include:

1. Push: Adding an element to the top of the stack.
2. Pop: Removing the element from the top of the stack.
3. Peek (or Top): Viewing the element at the top of the stack without removing it.
4. IsEmpty: Checking if the stack is empty.

Applications of stacks include:

1. Function Call Management: Stacks are used in programming languages to manage function calls and returns, keeping track of active functions and their local variables.
2. Undo Mechanism: In applications like text editors, stacks are used to implement the undo feature, allowing users to revert to previous states.
3. Expression Evaluation: Stacks are used in parsing expressions, particularly in converting infix expressions to postfix or prefix notation and evaluating them.
4. Backtracking: Stacks are utilized in algorithms that require backtracking, such as solving mazes or puzzles, where you need to keep track of previous states.
5. Memory Management: Stacks are used in managing memory allocation for local variables in programming.

_______

A set is a collection of distinct elements, meaning that each element can occur only once. Sets are commonly used in mathematics and computer science to group unique items together. The main characteristic of a set is that it does not allow duplicate values.

For example, consider the set of fruits: {apple, banana, orange}. In this case, each fruit appears only once, and the order does not matter.

A multiset, on the other hand, allows for multiple occurrences of the same element. This means that elements can appear more than once in a multiset. Multisets are also known as bags.

For example, consider a multiset of fruits: {apple, apple, banana, orange}. Here, the element “apple” appears twice, while “banana” and “orange” appear only once.

To summarize:

– A set contains unique elements: {1, 2, 3}

– A multiset can contain duplicates: {1, 1, 2, 3}

Sets are useful for operations like union, intersection, and difference, while multisets can be useful in scenarios where the frequency of elements matters, such as counting items or managing inventory.

____

Merge sort is a divide-and-conquer algorithm that sorts an array or list by dividing it into smaller subarrays, sorting those subarrays, and then merging them back together. Here’s how it works step-by-step:

1. Divide: The array is recursively divided into two halves until each subarray contains a single element. A single-element array is considered sorted.

Conquer: The sorted subarrays are then merged back together in a sorted manner

Combine: The merging process continues until all subarrays are combined back into a single sorted array

Let’s go through an example to illustrate how merge sort works.

Suppose we have the following unsorted array: [38, 27, 43, 3, 9, 82, 10].

### Step 1: Divide
We start by dividing the array into two halves:

– Left half: [38, 27, 43]
– Right half: [3, 9, 82, 10]

We further divide the left half:

– Left half of left: [38]
– Right half of left: [27, 43]

Continuing to divide [27, 43]:

– Left: [27]
– Right: [43]

Now we have the left half completely divided: [38], [27], [43].

Next, we divide the right half:

– Left half of right: [3, 9]
– Right half of right: [82, 10]

Dividing [3, 9]:

– Left: [3]
– Right: [9]

And dividing [82, 10]:

– Left: [82]

– Right: [10]

Now we have all elements divided into single-element arrays.

### Step 2: Conquer (Merge)

Now we start merging the arrays back together in sorted order.

Merge [27] and [43]:

– Result: [27, 43]

Merge [38] and [27, 43]:

– Compare 38 with 27 (27 is smaller), then 38 with 43 (38 is smaller).

– Result: [27, 38, 43]

Merge [3] and [9]:

– Result: [3, 9]

Merge [82] and [10]:

– Result: [10, 82]

Now we have two sorted halves: [27, 38, 43] and [3, 9, 10, 82].

### Step 3: Combine

Finally, we merge the two sorted halves:

Merge [27, 38, 43] and [3, 9, 10, 82]:

– Start comparing the first elements of both arrays.

– 3 is the smallest, so it goes first.

– Then 9, then 10, then 27, then 38, then 43, and finally 82.

The final sorted array is: [3, 9, 10, 27, 38, 43, 82].

Here’s an algorithm to implement selection sort:

1. Start with an array of elements.
2. For each index ‘i’ from 0 to the length of the array minus 1:
– Assume the minimum value is at index ‘i’ (set min_index = i).
3. For each index ‘j’ from ‘i’ + 1 to the length of the array:
– If the element at index ‘j’ is less than the element at min_index, update min_index to ‘j’.
4. If min_index is not equal to ‘i’, swap the elements at index ‘i’ and min_index.
5. Repeat steps 2 to 4 until the entire array is sorted.
6. End the algorithm.

### Example of Selection Sort Algorithm

Let’s say we have an array: [64, 25, 12, 22, 11].

1. Start with the first element (64).
– Set min_index to 0 (64).
– Compare with the rest: 25, 12, 22, 11.
– The smallest is 11 at index 4, so swap 64 and 11.
– Array now: [11, 25, 12, 22, 64].

2. Move to the next element (25).
– Set min_index to 1 (25).
– Compare with 12, 22, 64.
– The smallest is 12 at index 2, so swap 25 and 12.
– Array now: [11, 12, 25, 22, 64].

3. Move to the next element (25).
– Set min_index to 2 (25).
– Compare with 22, 64.
– The smallest is 22 at index 3, so swap 25 and 22.
– Array now: [11, 12, 22, 25, 64].

4. Move to the next element (25).
– Set min_index to 3 (25).
– Compare with 64.
– No swap needed since 25 is smaller.
– Array remains: [11, 12, 22, 25, 64].

Finally, move to the last element (64). – No more elements to compare. – The array is now sorted

The final sorted array is: [11, 12, 22, 25, 64].

Balanced search trees offer several advantages:

1. Efficient Search Operations: Balanced search trees, such as AVL trees or Red-Black trees, maintain a height that is logarithmic relative to the number of nodes. This ensures that search operations can be performed in O(log n) time, making it efficient for looking up values.

2. Insertion and Deletion Efficiency: Like search operations, both insertion and deletion in balanced search trees also occur in O(log n) time. This is due to the tree‘s balanced nature, which keeps the height minimal.

3. Ordered Data Structure: Balanced search trees maintain the order of elements, which allows for in-order traversal to retrieve elements in sorted order. This is useful for applications that require sorted data.

4. Dynamic Data Structure: They can efficiently handle dynamic datasets where insertions and deletions occur frequently. The tree can adjust itself to maintain balance after such operations.

5. Space Efficiency: Balanced search trees use space efficiently, typically requiring space proportional to the number of nodes, which is O(n).

6. Support for Range Queries: They can efficiently support range queries, allowing you to find all elements within a certain range in O(log n + k) time, where k is the number of elements in the range.

Overall, balanced search trees provide a good combination of performance and functionality, making them a popular choice for many applications involving dynamic sets of data.

________

A directed graph and an undirected graph are two fundamental types of graphs used in graph theory.

Directed Graph (Digraph): In a directed graph, edges have a direction. This means that each edge connects two vertices (nodes) and indicates a one-way relationship from one vertex to another. For example, if there is a directed edge from vertex A to vertex B, it means you can go from A to B, but not necessarily from B to A.

*Example*: Consider a social media platform where users can follow each other. If user A follows user B, this can be represented as a directed edge from A to B. However, if user B does not follow user A back, there is no directed edge from B to A.

Undirected Graph: In an undirected graph, edges do not have a direction. This means that the relationship between the vertices is bidirectional, and you can traverse the edge in both directions. If there is an edge between vertex A and vertex B, you can go from A to B and from B to A.

*Example*: Think of a friendship network where if user A is friends with user B, then user B is also friends with user A. This can be represented as an undirected edge between A and B, indicating a mutual relationship.

To reverse a string, you can follow this simple algorithm:

1. Start with the input string.
2. Create an empty string to hold the reversed version.
3. Loop through the original string from the last character to the first character.
4. In each iteration, append the current character to the empty string.
5. After the loop, the empty string will contain the reversed string.
6. Return or print the reversed string.

Here’s a step-by-step example using the string “hello”:

1. Input string: “hello”
2. Initialize an empty string: reversed_string = “”
3. Loop through the string from the last character to the first:
– i = 4 (character ‘o’): reversed_string = “o”
– i = 3 (character ‘l’): reversed_string = “ol”
– i = 2 (character ‘l’): reversed_string = “oll”
– i = 1 (character ‘e’): reversed_string = “olle”
– i = 0 (character ‘h’): reversed_string = “olleh”
4. End of the loop.
5. The final reversed string is “olleh”.

So, the algorithm effectively reverses the input string. The final answer is “olleh” for the input “hello”.

Programming models are frameworks or paradigms that define how programs are structured and how they interact with data and processes. They help developers understand the flow of data and control, and they can significantly influence how software is designed and implemented. Here are some common programming models:

1. Procedural Programming: This model is based on the concept of procedure calls. Programs are structured as a sequence of instructions or procedures that operate on data. It emphasizes a linear flow of control and uses constructs like loops and conditionals. Languages like C and Pascal are examples of procedural programming.

2. Object-Oriented Programming (OOP): OOP is centered around the concept of “objects,” which can encapsulate data and behavior. It promotes code reusability through inheritance and polymorphism. Common principles include encapsulation, abstraction, inheritance, and polymorphism. Languages like Java, C++, and Python support OOP.

3. Functional Programming: This model treats computation as the evaluation of mathematical functions and avoids changing state or mutable data. It emphasizes the use of functions as first-class citizens and supports higher-order functions. Languages like Haskell, Scala, and even JavaScript (to some extent) support functional programming.

4. Declarative Programming: In this model, you express the logic of a computation without describing its control flow. You specify what you want to achieve rather than how to achieve it. SQL is a common example of a declarative language, as it allows you to query databases without specifying the procedural steps to retrieve data.

5. Event-Driven Programming: This model is based on the concept of events and event handlers. Programs respond to user actions or system events (like mouse clicks or key presses). It is widely used in graphical user interfaces (GUIs) and web applications. JavaScript in the context of web programming is a prime example of event-driven programming.

A symbol table is a data structure used by compilers and interpreters to store information about the symbols (identifiers) in a program. It plays a crucial role in the process of compiling or interpreting code. Here are the key aspects of a symbol table:

1. Purpose: The primary purpose of a symbol table is to keep track of various identifiers such as variable names, function names, class names, and their associated information like data types, scope, and memory locations.

2. Structure: Symbol tables can be implemented using different data structures, such as hash tables, binary search trees, or linked lists. The choice of structure affects the efficiency of operations like insertion, deletion, and lookup.

3. Scope Management: Symbol tables help manage variable scopes. For example, in nested functions or blocks, a new symbol table can be created for the inner scope, and it can reference the outer scope’s symbol table as needed.

Information Stored: A symbol table typically stores:

– Identifier Name: The name of the variable, function, or class.

– Data Type: The type of data the identifier holds (e.G., integer, float, string).

– Memory Location: The address in memory where the identifier’s value is stored.

– Scope Information: Information about where the identifier is valid (e.G., local, global).

5. Usage in Compilation: During the compilation process, the compiler uses the symbol table to check for semantic errors, such as type mismatches or undeclared variables. It also helps in generating the correct code by providing the necessary information about each identifier.

6. Lifetime: Symbol tables can have different lifetimes. For example, a symbol table for a function’s local variables exists only while the function is executing,

A minimum spanning tree (MST) algorithm is used to find a subset of the edges in a connected, undirected graph that connects all the vertices together without any cycles and with the minimum possible total edge weight. There are several algorithms to find the MST, with the two most common being Prim’s algorithm and Kruskal’s algorithm. Here’s a brief overview of both:

Prim’s Algorithm:

– Start with a single vertex and grow the minimum spanning tree one edge at a time.

– Initialize an empty tree and add the starting vertex to it.

– While there are still vertices not included in the tree:

– Find the edge with the smallest weight that connects a vertex in the tree to a vertex outside the tree.

– Add the selected edge and the new vertex to the tree.

– Repeat until all vertices are included.

Kruskal’s Algorithm:

– Start with a forest (a set of trees) where each vertex is its own tree.

– Create a list of all the edges in the graph and sort them by weight.

– Initialize an empty tree for the MST.

– Iterate through the sorted edge list:

– For each edge, if it connects two different trees (i.E., it does not form a cycle), add it to the MST and merge the two trees.

– Continue until there are $ V – 1 $ edges in the MST (where $ V $ is the number of vertices).

Both algorithms have their own advantages and use cases. Prim’s algorithm is generally more efficient for dense graphs, while Kruskal’s algorithm is more efficient for sparse graphs.

A regular expression (often abbreviated as regex or regexp) is a sequence of characters that forms a search pattern. It is mainly used for string matching within texts, allowing you to search, replace, or validate input based on specific patterns. Regular expressions are widely used in programming, text processing, and data validation.

Here are some key concepts and components of regular expressions:

1. Literals: These are the simplest form of regex, where you match the exact characters. For example, the regex “cat” matches the string “cat”.

2. Metacharacters: These characters have special meanings in regex, such as:
– `.`: Matches any single character except a newline.
– `^`: Matches the start of a string.
– `$`: Matches the end of a string.
– `*`: Matches zero or more occurrences of the preceding element.
– `+`: Matches one or more occurrences of the preceding element.
– `?`: Matches zero or one occurrence of the preceding element.

3. Character Classes: You can define a set of characters to match. For example, `[abc]` matches any single character that is either ‘a’, ‘b’, or ‘c’. Ranges can also be used, like `[a-z]`, which matches any lowercase letter.

4. Groups and Capturing: Parentheses `()` are used to group parts of the regex. For example, `(abc)+` matches one or more occurrences of the sequence “abc”.

5. Quantifiers: These specify how many times the preceding element should match. Common quantifiers include:
– `{n}`: Matches exactly n occurrences.
– `{n,}`: Matches n or more occurrences.
– `{n,m}`: Matches between n and m occurrences.

6. Anchors: These are used to specify positions in the string. For example, `\b` matches a word boundary, and `\B` matches a non-word boundary.