Resources Contact Us Home
Method for refining the initial conditions for clustering with applications to small and large database clustering

Image Number 5 for United States Patent #6115708.

As an optimization problem, clustering data (unsupervised learning) is known to be a difficult problem. Most practical approaches use a heuristic, typically gradient-descent, algorithm to search for a solution in the huge space of possible solutions. Such methods are by definition sensitive to starting points. It has been well-known that clustering algorithms are extremely sensitive to initial conditions. Most methods for guessing an initial solution simply make random guesses. In this paper we present a method that takes an initial condition and efficiently produces a refined starting condition. The method is applicable to a wide class of clustering algorithms for discrete and continuous data. In this paper we demonstrate how this method is applied to the popular K-means clustering algorithm and show that refined initial starting points indeed lead to improved solutions. The technique can be used as an initializer for other clustering solutions. The method is based on an efficient technique for estimating the modes of a distribution and runs in time guaranteed to be less than overall clustering time for large data sets. The method is also scalable and hence can be efficiently used on huge databases to refine starting points for scalable clustering algorithms in data mining applications.

  Recently Added Patents
Method and composition for treating acne using lignan compounds
Deflection measuring system and method
Systems for patient transfer, devices for movement of a patient, and methods for transferring a patient
Method of determining a location of a mobile device and method of managing a list for use in such a method
Attribute based hierarchy management for estimation and forecasting
Vibrating footwear device and entertainment system for use therewith
Content shooting apparatus
  Randomly Featured Patents
Cover plate for a mobile device having a push-through dial keypad
System for controlling multiple controllable devices according to a script transmitted from a personal computer
Ultrasonic transceiver
Method and apparatus of determining energy content of a diluted hydrocarbon gas
Quantity gauging
Granules of textile processing agents for use in organic solvent liquors
Lens, lens array and method for making lens array
Optoelectronic device for meter optical detector module
Method and apparatus for cooling shaft seals
Numerical controller for wire electrical discharge machine