Predicting Network Drama Broadcast Volume Based on Sentiment Analysis and Stacking Model
CSTR:
Author:
Affiliation:

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    With the rapid development of network dramas in recent years, the research on broadcast volume has gradually attracted attention. Broadcast volume reflects the reputation and popularity of a network drama, which are closely related to the profits of producers and investors. However, current research rarely considers the impact of the sentiments in viewers’ comments on broadcast volume, and the forecasting models are simple. Consequently, the accuracy of prediction needs to be further improved. After a sentiment analysis of users’ comments, we construct a stacking ensemble learning model to predict the broadcast volume of network dramas in China. Using the SO-PMI (semantic orientation-pointwise mutual information) algorithm, we build a sentiment dictionary in the network drama domain. A basic sentiment dictionary and the number of likes are also taken into account to calculate the comment sentiment scores, which are then added into the prediction index system. With random forest (RF), GBDT (gradient boosting decision tree), XGBoost (extreme gradient boosting), and LightGBM (light gradient boosting machine) as base learners and MLR as a meta learner, a stacking prediction model is constructed to predict the broadcast volume of a network drama in stages. The broadcast volume of the next week can be forecasted with data of the current week. Finally, the results of different models are compared and analyzed, and the importance scores of predictive variables are obtained. The experimental results show that the determination coefficient R-square of the proposed model reaches 0.89, which is higher than that of a single base learner (maximum 0.84) as well as that of the stacking model without sentiment score variables (0.81). It can be concluded that with sentiment score variables, the proposed stacking ensemble learning model delivers better prediction accuracy on the broadcast volume of network dramas than that of traditional models.

    Reference
    Related
    Cited by
Get Citation

李明珠,米传民,肖琳,许乃元.加入情感分析的Stacking模型在网络剧播放量预测中的应用.计算机系统应用,2022,31(6):315-323

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:August 28,2021
  • Revised:September 26,2021
  • Adopted:
  • Online: February 21,2022
  • Published:
Article QR Code
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063