Predicting Network Drama Broadcast Volume Based on Sentiment Analysis and Stacking Model
CSTR:
Author:
  • Article
  • | |
  • Metrics
  • |
  • Reference [30]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    With the rapid development of network dramas in recent years, the research on broadcast volume has gradually attracted attention. Broadcast volume reflects the reputation and popularity of a network drama, which are closely related to the profits of producers and investors. However, current research rarely considers the impact of the sentiments in viewers’ comments on broadcast volume, and the forecasting models are simple. Consequently, the accuracy of prediction needs to be further improved. After a sentiment analysis of users’ comments, we construct a stacking ensemble learning model to predict the broadcast volume of network dramas in China. Using the SO-PMI (semantic orientation-pointwise mutual information) algorithm, we build a sentiment dictionary in the network drama domain. A basic sentiment dictionary and the number of likes are also taken into account to calculate the comment sentiment scores, which are then added into the prediction index system. With random forest (RF), GBDT (gradient boosting decision tree), XGBoost (extreme gradient boosting), and LightGBM (light gradient boosting machine) as base learners and MLR as a meta learner, a stacking prediction model is constructed to predict the broadcast volume of a network drama in stages. The broadcast volume of the next week can be forecasted with data of the current week. Finally, the results of different models are compared and analyzed, and the importance scores of predictive variables are obtained. The experimental results show that the determination coefficient R-square of the proposed model reaches 0.89, which is higher than that of a single base learner (maximum 0.84) as well as that of the stacking model without sentiment score variables (0.81). It can be concluded that with sentiment score variables, the proposed stacking ensemble learning model delivers better prediction accuracy on the broadcast volume of network dramas than that of traditional models.

    Reference
    [1] 王康. 中国网络剧发展的动力机制. 青年记者, 2019, (16): 55–57
    [2] 吕赜如. 浅谈网络剧相对于电视剧的创新和发展. 科技传播, 2019, 11(11): 105–107. [doi: 10.3969/j.issn.1674-6708.2019.11.069
    [3] 朱恺勐. 媒介融合视角下网络剧如何发展. 戏剧之家, 2019, (8): 118. [doi: 10.3969/j.issn.1007-0125.2019.08.086
    [4] 王大员, 徐敏. 低成本网络剧的营销策略分析——基于网络剧播放量的实证研究. 新闻世界, 2019, (2): 60–64
    [5] 肖兴辉, 李波. 营销节奏对网络剧播放量的影响因素探析. 现代传播(中国传媒大学学报), 2019, 41(7): 133–136
    [6] 李聪, 余梦, 汪之舟, 等. 基于弹幕大数据的网络视频精准营销模式研究. 市场周刊, 2020, (6): 90–91
    [7] Litman BR. Predicting success of theatrical movies: An empirical study. Journal of Popular Culture, 1983, 16(4): 159–175. [doi: 10.1111/j.0022-3840.1983.1604_159.x
    [8] Chance DM, Hillebrand E, Hilliard JE. Pricing an option on revenue from an innovation: An application to movie box office revenue. Management Science, 2008, 54(5): 1015–1028. [doi: 10.1287/mnsc.1070.0826
    [9] Feng N, Feng HH, Li DH, et al. Online media coverage, consumer engagement and movie sales: A PVAR approach. Decision Support Systems, 2020, 131: 113267. [doi: 10.1016/j.dss.2020.113267
    [10] Quader N, Gani MO, Chaki D, et al. A machine learning approach to predict movie Box-Office success. 2017 20th International Conference of Computer and Information Technology (ICCIT). Dhaka: IEEE, 2017. 1–7.
    [11] Zhang L, Luo JH, Yang SY. Forecasting box office revenue of movies with BP neural network. Expert Systems with Applications, 2009, 36(3): 6580–6587. [doi: 10.1016/j.eswa.2008.07.064
    [12] 米传民, 鲁月, 林清同. 基于加权K-Means和局部BPNN的票房预测模型. 计算机系统应用, 2019, 28(2): 15–23. [doi: 10.15888/j.cnki.csa.006709
    [13] Wu SY, Zheng YF, Lai ZK, et al. Movie box office prediction based on ensemble learning. 2019 IEEE Symposium on Product Compliance Engineering—Asia (ISPCE-CN). Hong Kong: IEEE, 2019. 1–4.
    [14] Liao Y, Peng YX, Shi SL, et al. Early box office prediction in China’s film market based on a stacking fusion model. Annals of Operations Research, 2020: 1–18
    [15] 李旺泽. 基于监督学习的国产电影票房影响因素研究[硕士学位论文]. 武汉: 湖北工业大学, 2020.
    [16] 杨曼. 基于XGBoost算法的电影票房预测[硕士学位论文]. 兰州: 兰州大学, 2020.
    [17] Apala KR, Jose M, Motnam S, et al. Prediction of movies box office performance using social media. 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013). Niagara Falls: IEEE, 2013. 1209–1214.
    [18] 王晓耘, 袁媛, 史玲玲. 基于微博的电影首映周票房预测建模. 现代图书情报技术, 2016, (4): 31–39
    [19] 朱梦娴. 口碑与票房: 社会化媒体电影评论的商业价值研究. 信息资源管理学报, 2015, 5(4): 57–64, 80
    [20] Ahmad IS, Bakar AA, Yaakub MR, et al. Sequel movie revenue prediction model based on sentiment analysis. Data Technologies and Applications, 2020, 54(5): 665–683. [doi: 10.1108/DTA-10-2019-0180
    [21] Hur M, Kang P, Cho S. Box-office forecasting based on sentiments of movie reviews and Independent subspace method. Information Sciences, 2016, 372: 608–624. [doi: 10.1016/j.ins.2016.08.027
    [22] Gaikar DD, Marakarkandy B, Dasgupta C. Using Twitter data to predict the performance of Bollywood movies. Industrial Management & Data Systems, 2015, 115(9): 1604–1621
    [23] Qiu XY, Tang TY. Microblog mood predicts the box office performance. Proceedings of the 2018 Artificial Intelligence and Cloud Computing Conference. Tokyo: ACM, 2018. 129–133.
    [24] Ding C, Cheng HK, Duan Y, et al. The power of the “like” button: The impact of social media on box office. Decision Support Systems, 2017, 94: 77–84. [doi: 10.1016/j.dss.2016.11.002
    [25] Zhao M, Zhang TZ, Chai JP. Based on SO-PMI algorithm to discriminate sentimental words’ polarity in TV programs’ subjective evaluation. 2015 8th International Symposium on Computational Intelligence and Design (ISCID). Hangzhou: IEEE, 2015. 38–40.
    [26] Kim J, Moon N. Rating and comments mining using TF-IDF and SO-PMI for improved priority ratings. KSII Transactions on Internet and Information Systems, 2019, 13(11): 5321–5334
    [27] 王勇, 吕学强, 姬连春, 等. 基于极性词典的中文微博客情感分类. 计算机应用与软件, 2014, 31(1): 34–37, 126. [doi: 10.3969/j.issn.1000-386x.2014.01.010
    [28] 刘彦虬. 基于HowNet和用户点赞的中文电影评论情感分析研究[硕士学位论文]. 呼和浩特: 内蒙古大学, 2019.
    [29] 张公让, 鲍超, 王晓玉, 等. 基于评论数据的文本语义挖掘与情感分析. 情报科学, 2021, 39(5): 53–61
    [30] 朱国森, 郑晓亮. 基于stacking集成模型的网络流量预测研究. 重庆工商大学学报(自然科学版), 2021, 38(2): 16–22
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

李明珠,米传民,肖琳,许乃元.加入情感分析的Stacking模型在网络剧播放量预测中的应用.计算机系统应用,2022,31(6):315-323

Copy
Share
Article Metrics
  • Abstract:655
  • PDF: 1667
  • HTML: 1112
  • Cited by: 0
History
  • Received:August 28,2021
  • Revised:September 26,2021
  • Online: February 21,2022
Article QR Code
You are the first990366Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063