Abstract:Based on the introduction of Open Source Framework of Hadoop, encoded pattern of Map/Reduce and the calculated formula of sentence similarity, the present paper adopts the method of sentence group similarity parallel computing to the encoded pattern of Map/Reduce in terms of Hadoop. And it verifies the stability and feasibility of dealing with tremendous data.