Abstract:To solve the problem of mixed subjects returned by search engine results, a new subject phrases clustering algorithm is presented to help locate the valuable results that the users really need. The algorithm firstly extractes some subject phrases from the search results. Then, the vector space model is built. Finally, the results are clustered by the improved k-means algorithm. The algorithm was tested and validated by the experiments.