本文已被:浏览 2288次 下载 3149次
Received:April 14, 2017 Revised:May 02, 2017
Received:April 14, 2017 Revised:May 02, 2017
中文摘要: 针对现有音乐平台分类类别固定、检索内容限制过多的问题,本文提出了一种基于用户评论的自动化音乐分类方法. 首先,通过linear CRF统计分词模型、n元取词和紧密度分析方法学习得到适合音乐语料分词的字典. 其次,使用linear CRF在上述字典的基础上进行分词,对分词结果进行分合测试,修正分词结果. 然后,使用优化后的TFIDF关键词提取算法进行标签提取,再经过标签合并得到音乐的候选标签. 接着,从全局角度出发对标签进一步筛选,得到音乐的关联标签. 最后,建立音乐和标签之间的概率分类模型,对音乐进行分类. 实验结果表明,该音乐分类方法准确率较高,可以从用户评论中自动地获取音乐多个维度的分类标签,为个性化的音乐检索提供了保障.
Abstract:An automatic music classification method based on users' comments is presented in view of the few categories and limited search content for the existing music platforms. First of all, a linear CRF statistic segmentation model, n-gram word extraction and affinity analysis method are used to obtain a dictionary which can be adapted to music corpus segmentation. Secondly, we use linear CRF model to segment comments with dictionary above, and then we correct the segmentation result via split-merge testing. Thirdly, the optimized TFIDF keyword extraction model is applied to extract candidate tags, and we merge tags after that. Fourthly, candidate tags with fewer frequency are filtered from a global perspective. Finally, a probability classification network is established between the music and filtered tags to classify music. As the result shows, our music classification method achieves high accuracy. Furthermore, it can ensure the personality of music retrieval for generating music tags automatically in multiple dimensions according to users' comments.
keywords: music classification word segmentaion model affinity analysis keyword extraction associated tags
文章编号: 中图分类号: 文献标志码:
基金项目:安徽省自然科学基金(1408085MKL06);高等学校学科创新引智计划项目(B07033)
引用文本:
郝建林,黄章进,顾乃杰.基于用户评论的自动化音乐分类方法.计算机系统应用,2018,27(1):154-161
HAO Jian-Lin,HUANG Zhang-Jin,GU Nai-Jie.Automatic Music Classification Method Based on Users' Comments.COMPUTER SYSTEMS APPLICATIONS,2018,27(1):154-161
郝建林,黄章进,顾乃杰.基于用户评论的自动化音乐分类方法.计算机系统应用,2018,27(1):154-161
HAO Jian-Lin,HUANG Zhang-Jin,GU Nai-Jie.Automatic Music Classification Method Based on Users' Comments.COMPUTER SYSTEMS APPLICATIONS,2018,27(1):154-161