Draft:Klee's Algorithm

From Wikipedia, the free encyclopedia


Framed as an open research problem published in 1977 in The American Mathematical Monthly,[1] Victor Klee presented a challenge to the journal's readers to find an algorithm to compute the measure of the union of a set of intervals that would be more efficient than sorting the interval endpoints and sequentially adding the resulting lengths together. Klee then provided an algorithm in pseudocode that performs the sorting and summation as a baseline solution for this problem with a complexity of . This complexity was later proven to be optimal in 1978 by Michael Freman and Bruce Weide.[2]

As an additional challenge, Klee invited the readers to consider the problem in multiple dimensions and how efficiently someone could solve the complexity of finding a disjoint set of d-dimensional intervals, reducing that set to the minimum number of intervals possible, and computing the measure of their union. This last question was eventually labeled Klee's measure problem.

Algorithm[edit]

  1. Separate the endpoint pairs for each interval and tag if each endpoint was on the left or the right.
  2. Sort the individual endpoints in ascending order of value, with the additional condition that left endpoints are before right endpoints when two endpoint values are the same. (This changes any partially overlapping intervals into a series of nested ones.)
  3. Iterate through the sorted endpoints, keeping track of the interval nesting count and recording just the outermost interval of each nesting.
  4. Add up the measures of each of the resulting set of disjointed intervals.

Logical error in the original[edit]

The original pseudocode had the following lines inside of the loop through the endpoints:


if  then  and 
if  then 

where is the interval nesting count, is the current endpoint index, = +1 for left endpoints and -1 for right endpoints, is the current endpoint value, is the index of the current interval for the new set, and and are the start and end of the current interval for the new set, respectively. This code incorrectly jumps ahead at the right endpoint of the second outermost interval and uses that endpoint as the left endpoint of the next recorded interval.

To fix this problem, the pseudocode could be rewritten as follows:

if  then  and 

if  then 

This way, the first line only triggers before the first endpoint of a nested set of endpoints and the second line only triggers after the end of the last endpoint of such a set.

Javascript implementation[edit]

function measureOfUnion(intervals = []) {
    // the input is expected to be an array of intervals,
    // with each interval being a 2-element array
    const endpoints = [];
    for (let interval of intervals) {
        endpoints.push({v: interval[0], nestingChange: +1});
        endpoints.push({v: interval[1], nestingChange: -1});
    }
    endpoints.sort(endpointSort);
    const disjointIntervals = [];
    let nestingLevel = 0;
    let currentInterval;
    for (let endpoint of endpoints) {
        if (nestingLevel === 0) {
            currentInterval = [endpoint.v];
            disjointIntervals.push(currentInterval);
        }
        nestingLevel += endpoint.nestingChange;
        if (nestingLevel === 0)
            currentInterval.push(endpoint.v);
    }
    let measure = 0;
    for (let interval of disjointIntervals) {
        measure += interval[1] - interval[0];
    }
    return measure;
}

function endpointSort(a, b) {
    // normal expression for ascending order
    if (a.v != b.v) return a.v - b.v;
    // swap endpoints if the first one ends an interval
    if (a.nestingChange == -1) return +1;
    // otherwise, stay in this order
    return -1;
}

References[edit]

  1. ^ Klee, Victor (1977). "Can the measure of be computed in less than steps?". American Mathematical Monthly. 84 (4): 284–285. doi:10.2307/2318871. JSTOR 2318871. MR 0436661.
  2. ^ *Fredman, Michael L.; Weide, Bruce (1978). "The complexity of computing the measure of ". Communications of the ACM. 21 (7): 540–544. doi:10.1145/359545.359553. MR 0495193. S2CID 16493364.