Resources Contact Us Home
Browse by: INVENTOR PATENT HOLDER PATENT NUMBER DATE
 
 
Method for refining the initial conditions for clustering with applications to small and large database clustering










Image Number 5 for United States Patent #6115708.

As an optimization problem, clustering data (unsupervised learning) is known to be a difficult problem. Most practical approaches use a heuristic, typically gradient-descent, algorithm to search for a solution in the huge space of possible solutions. Such methods are by definition sensitive to starting points. It has been well-known that clustering algorithms are extremely sensitive to initial conditions. Most methods for guessing an initial solution simply make random guesses. In this paper we present a method that takes an initial condition and efficiently produces a refined starting condition. The method is applicable to a wide class of clustering algorithms for discrete and continuous data. In this paper we demonstrate how this method is applied to the popular K-means clustering algorithm and show that refined initial starting points indeed lead to improved solutions. The technique can be used as an initializer for other clustering solutions. The method is based on an efficient technique for estimating the modes of a distribution and runs in time guaranteed to be less than overall clustering time for large data sets. The method is also scalable and hence can be efficiently used on huge databases to refine starting points for scalable clustering algorithms in data mining applications.








 
 
  Recently Added Patents
LCD television set capable of external connection with application processor
Composite aircraft floor system
Data paths using a first signal to capture data and a second signal to output data and methods for providing data
Panel for decoration
Machine shop including computer system that interfaces with different legacy servers
Contact detection between a disk and magnetic head
Client network device and method for adjusting parameters of client traffic windows based on discovery of other network devices
  Randomly Featured Patents
Sewing machine
Enhancing flame retardancy with organobromosilicone fluids
Sensor using fiber interferometer
Vehicle seat position adjusting device
Thermal conductivity measuring method and apparatus, and gas component ratio measuring apparatus
Streamers and bubbles
Solar radiation collector
Multi-grained molding
Salicylic acid derivatives with fluorophores and method of making and using the same
Welding torch lighter