Resources Contact Us Home
Method for refining the initial conditions for clustering with applications to small and large database clustering

Image Number 5 for United States Patent #6115708.

As an optimization problem, clustering data (unsupervised learning) is known to be a difficult problem. Most practical approaches use a heuristic, typically gradient-descent, algorithm to search for a solution in the huge space of possible solutions. Such methods are by definition sensitive to starting points. It has been well-known that clustering algorithms are extremely sensitive to initial conditions. Most methods for guessing an initial solution simply make random guesses. In this paper we present a method that takes an initial condition and efficiently produces a refined starting condition. The method is applicable to a wide class of clustering algorithms for discrete and continuous data. In this paper we demonstrate how this method is applied to the popular K-means clustering algorithm and show that refined initial starting points indeed lead to improved solutions. The technique can be used as an initializer for other clustering solutions. The method is based on an efficient technique for estimating the modes of a distribution and runs in time guaranteed to be less than overall clustering time for large data sets. The method is also scalable and hence can be efficiently used on huge databases to refine starting points for scalable clustering algorithms in data mining applications.

  Recently Added Patents
Opportunistic modem
Systems and methods for automated institutional processing of payments
Selecting from a plural of energy saving modes
Method of producing probabilities of being a template shape
Image processing apparatus capable of using replacement component, image forming apparatus capable of using replacement component, and method of administrating replacement component
Polymeric compositions including their uses and methods of production
Copolymer for positive type lithography, polymerization initiator used in production of said copolymer, and composition for semiconductor lithography
  Randomly Featured Patents
Printer with a cutter having a pushing portion for moving the fixed blade away from the movable blade
Liquid crystal display device
Apparatus for winding up and unwinding printed products infed and outfed in an imbricated formation by means of a conveyor
Imaging system for compensating for image distortion due to wrinkled or curled documents
Pediatric arm restraining device
Vacuum cleaner with drive assist
Apparatus for selectively rotating and/or advancing an elongate device
Method for limiting the incidence of postoperative adhesions
Multi-mode DRAM controller
Integrated circuits and methods to compensate for defective non-volatile embedded memory in one or more layers of vertically stacked non-volatile embedded memory