Conventional Indexes

Last updated Nov 8, 2022 Edit Source

# Conventional Indexes

Indexes are needed to reduce the I/O required to find a record.

# Updating Indexes

Locate the targeted record or the place to hold new record
Update data file
Update index

# Clustered and Non-Clustered Indexes

Clustering index: indexes on an attribute is such that all the tuples with a fixed value for the search key of this index appear on as few blocks as can hold them. If a relation is clustered (it must be sorted and packed together according to some attribute a) another index on another attribute b would likely be non-clustered unless a and b are highly correlated.

# Comparisons

# Read

A range read of keys that are close together will result in high number of I/O:

# Update

Clustered indexes will not be as good if the database goes through many update operations.

# Multi-layer Index

# Practice Problems

a. Dense: We need 300 key pointer pairs. Each block can hold 10 pairs. Total blocks = 300 / 10 = 30 Sparse: 1 index pointer can point to a block of 3 records. Each block can hold 10 pointers. 1 index block represents 30 records. $300/30=10$ blocks needed b. Worst case: retrieve the last record -> 10 I/O c. Another sparse index to point to a block of sparse index Since the initial sparse index needs 10 blocks to represent, the second level index can use 1 block (10 pointers) to fully represent it. I/O for 2nd level: 1 I/O for 1st level: 1 I/O to read record: 1 Total 3 I/O a. Best case when inserting a record in the not full block with record 9. Insert 10 1 I/O to read the index block, 1 I/O to load the block with record 9. Total 2 I/O b. Worst case when inserting into first data block. Insert 0. 1 I/O to read index block. Need to load every data block to shift records down. Total 1+4=5 I/O.

Brendan Ang