The paper proposes an optimized implementation of the algorithm based on the platform of C6000 DSPs, in consideration of the DSP hardware resource characteristics and software pipelining technique. Based on the analysis of algorithm process modular. We completed the designation and implementation of Floating-to-Fixed-Point conversion; for Compute-Intensive modules, we present the concrete optimized method, including improving the performance of Software pipelining, reducing memory access, achieving task level parallelism. The experiment made in Code Composer Studio results show that, the efficiency and resource utilization of the optimized implementation scheme are increasing, and the optimization method in this paper also apply to all C6000 DSPs.