Differences between revisions 9 and 10

Here is a brief description of the layout of the Objects in my Project and how the binary search partition is built.

Overview of the Process

At the bottom we have points that are either read in or generated randomly
The points are collected into cells and the cells make up a grid. Each cell has a density, and number of points associated with it.
The grid is put into one bucket whose lowerBound and upperBound are not points, but are designated by cells in the grid. Therefore the lowerBound will always be (0,0,0,0) for the first bucket and (X,X,X,X) where X is the last cell in each dimension.
From here on, we deal only with cells
A bucket is created to contain all the cells
The bucket is split into two buckets until the desired number of buckets is reached.

Defining the Cells

The question is what parameters decide the cell size/shape and grid size and shape? We determine this in the following way:

Find the lower and upper bound points in each dimesion lowerBound = (wl,xl,yl,zl) and upperBound = (wu,xu,yu,zu). This may be given by the random point generating function, but most often is determined as we read the points.
Given the resolution of the grid we can now determine the size/shape of the cells.

From bucket to buckets

Next the question is how is the first bucket made and how do we split the into more buckets.

Each bucket has:

lowerBound and upperBound cells
cell partition
number of points
skew
skew reduction

The first bucket is made up of all the cells in the Grid4D object. Bucket = lowerBound(0,0,0,0) and upperBound(X,X,X,X). Notice that these are cell numbers not actual positions.

When a bucket is created it is given cell bounds and the following are calculated:

Number of points
Skew

Note that we CANNOT CALCULATE THE:

partition point - because this creates buckets and that would start an endless loop.
skew reduction - because this requires the partition point.

Partitioning the Bucket

A bucket must be refreshed - that is we must calculate the partitionPoint and skewReduction by calling refresh();

The bucket then has a patitionCell. The partitionCell is the upper bound cell execept in the dimension it is split. When a bucket is split the lowerBucket and upperBucket are created as follows:

     lowerBucket = [lowerBoundCell, partitionCell]
     upperBucket = [lowerBoundCell except in the split dimension substitute partitionCell+1, upperBound]

Clearly than the calculation of the skew reduction must be done along this same split.

Running the Simulation

The following steps must be done in order:

Create a new DataGrid4D object called g
Determine the two query points: Point4D q1, q2
Set the number of buckets in the input DataGrid4D.setBuckets(int NumBuckets)

Here we start our actual contribution:

Build the parametric functions: DataGrid4D.buildParametricFunctions(Point4D q1,q2)
Run the MaxCount algorithm: BSP4D.MaxCount(float from, to)

Collecting statistics

For each run of the program we will collect in a comma seperated values list:

Number of buckets (50, 60, 70, 80, 90, 100)
Number of points (8K, 12K, 16K, 20K, 24K, 28K, 32K, 36K, 40K)
Distance between query points (400, 800, 1200, 1600, 2000)
Dense vs. Sparse regions (one each)

-  ⇤ ← Revision 9 as of 2005-02-23 18:04:39 → 
  Size: 3431
  Editor: yakko
  Comment:
+   ← Revision 10 as of 2005-02-23 18:19:21 → ⇥
  Size: 3541
  Editor: yakko
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 69:
-. Number of buckets
   1. Number of points
   1. Distance between query points
   1. Dense vs. Sparse regions
+. Number of buckets (50, 60, 70, 80, 90, 100)
   1. Number of points (8K, 12K, 16K, 20K, 24K, 28K, 32K, 36K, 40K)
   1. Distance between query points (400, 800, 1200, 1600, 2000)
   1. Dense vs. Sparse regions (one each)