Resources Contact Us Home
Method for refining the initial conditions for clustering with applications to small and large database clustering

Image Number 5 for United States Patent #6115708.

As an optimization problem, clustering data (unsupervised learning) is known to be a difficult problem. Most practical approaches use a heuristic, typically gradient-descent, algorithm to search for a solution in the huge space of possible solutions. Such methods are by definition sensitive to starting points. It has been well-known that clustering algorithms are extremely sensitive to initial conditions. Most methods for guessing an initial solution simply make random guesses. In this paper we present a method that takes an initial condition and efficiently produces a refined starting condition. The method is applicable to a wide class of clustering algorithms for discrete and continuous data. In this paper we demonstrate how this method is applied to the popular K-means clustering algorithm and show that refined initial starting points indeed lead to improved solutions. The technique can be used as an initializer for other clustering solutions. The method is based on an efficient technique for estimating the modes of a distribution and runs in time guaranteed to be less than overall clustering time for large data sets. The method is also scalable and hence can be efficiently used on huge databases to refine starting points for scalable clustering algorithms in data mining applications.

  Recently Added Patents
Soliciting first party in communication session to maintain call when degradation of connection to second party is anticipated
Method for editing a media clip in a mobile terminal device, a terminal device utilizing the method and program means for implementing the method
System for alarming upon occurrence of secession of smart key for vehicles
Intrinsic absorber layer for photovoltaic cells
Remote device pairing setup
Conductor-mixed active electrode material, electrode structure, rechargeable battery, and manufacturing method of conductor-mixed active electrode material
Electronic device
  Randomly Featured Patents
Determination of mask fitting pressure and correct mask fit
Method and system for querying and executing commands of an application program
Lighting and shadowing methods and arrangements for use in computer graphic simulations
System and method for providing road information in advance
Projection television display device
Method of preparing an otoplasty or adaptive earpiece individually matched to the shape of an auditory canal
Biaxially stretched bottle having a carrying handle
Engine speed control cable clamp
Apparatus for measuring fluorescent spectra of particles in a flow
Content addressable memory with concurrent read and search/compare operations at the same memory cell