2020美赛特等奖C题论文.pdf

发布时间：2022-06-05 发布人：admin 分类：说明书资料大小：13.19M 资料格式：pdf 举报版权申诉

qq_41809136-12707050-2020美赛特等奖C题6篇论文.pdf-第1页.png

第1页 / 共210页

qq_41809136-12707050-2020美赛特等奖C题6篇论文.pdf-第2页.png

第2页 / 共210页

qq_41809136-12707050-2020美赛特等奖C题6篇论文.pdf-第3页.png

第3页 / 共210页

qq_41809136-12707050-2020美赛特等奖C题6篇论文.pdf-第4页.png

第4页 / 共210页

qq_41809136-12707050-2020美赛特等奖C题6篇论文.pdf-第5页.png

第5页 / 共210页

qq_41809136-12707050-2020美赛特等奖C题6篇论文.pdf-第6页.png

第6页 / 共210页

qq_41809136-12707050-2020美赛特等奖C题6篇论文.pdf-第7页.png

第7页 / 共210页

qq_41809136-12707050-2020美赛特等奖C题6篇论文.pdf-第8页.png

第8页 / 共210页

2002116.pdf‎

Introduction

Assumptions and Notations

Assumptions

Notations

Informative Evaluation Model

Vector Encoding Forms of Star Ratings

Contextual Entropy VADER Hybrid Model for Text-Based Measures

Manually Annotating the Seed Word

Contextual Entropy Block (CE)

VADER Block

Proposed CE-VADER for Sentiment Analysis

Combination of Text-Based and Rating-Based Measures

Model Implementation, Sensitivity Analysis and Results

Difference Equation to Measure Time-Based Pattern

Difference Equation Based Model

Model Implementation, Sensitivity Analysis and Results

Predict Potential Success or Failure

Time Series Forecasting for Predicting Future Reputation

Evaluating the Success or Failure potential

Model Implementation and Results

Specific Ratings and Descriptors Analysis

Specific Star Ratings Relevance to Rating Frequency

Specific Quality Descriptors' Relevance to Rating Levels

Naive Bayesian Model for Evaluation

Model Implementation and Results

Attractiveness Analysis of Design Features

Sales Strategies and Recommendations

Strengths and Weaknesses

Strengths

Weaknesses

Conclusion

A Letter to the Marketing Director of Sunshine Company

Appendices

Appendix Annotated Seed Words and Frequency

Appendix The Number of Keyword Occurrences in Different Keyword Groups

Appendix Top 1% Most Informative Ratings and Reviews

Appendix Source Code for VADER Sentiment Analysis

Appendix Source Code for Informative Algorithm

Appendix Source Code for Reputation Calculation

Appendix Source Code for Beyes Model

Appendix Source Code for Time Series Prediction

Appendix Source Code for Wordcloud Picture

2003717.pdf‎

Introduction

Problem Background

Clarification and Restatement

Our Work

Problem 1: Data Preprocessing and Mining

Data Cleaning

Text Mining by LDA Topic ModelLDA

Overall Data Characteristics

Problem 2 (a): Ratings and Reviews Based Data Measures

Weighted Rating Ratio

Avg and Std of Weighted Sentimental Scores for Reviewsbhatt2015amazonhaque2018sentimentkleinbaum2002logistic

Users' Preference Vector

Problem 2 (b): Reputation Metric

Reputation Metric

Analysis on Reputation Variation Patterns

Problem 2 (c): Nested Two-Layer LSTM

The Structure of the Nested Two-layer LSTM Model

Analysis on Potential Success of Products

Problem 2 (d): Causal Effectiveness Between Reveiws

Ripple Effects of Extreme Ratings

Causal Inference for Ratio of Low Rating and Review Length

Problem 2 (e)dhanasobhon2007analysis: Correlation between Affective Words and Star Ratings

Analysis for Alignment of Rate and Review Scored by Certain Affective Words

Micro Observation of Asymmetricity between Rate and Reviewchen2008all

Strengths and Weaknesses

Strengths

Weaknesses

Conclusion

The Letter to the Marketing Director of Sunshine Company

Appendices

Appendix LDA Topic Model for Microwave and Pacifier

Appendix Product Contrast of Microwave and Pacifier Over Time

Appendix Code

Data Preprocess and Overall Analysis

LDA Analysis

Sentimental Score

LSTM

Reputation Model

2004647.pdf‎

Introduction

Problem Restatement

Literature Review

Data Cleaning

Modeling Framework

Assumptions & Nomenclature

the RRBS Model

Model Overview

the Rating Vector

the Review Vector

The Review-based Measure Equation

the Sentiment Polarity Matrix

the Reputation Model

Time Weight Sequence

Quantification of Reputation

Model Evaluation

Trend Similarity between Quantified Reputation and Sales

Kendall's Tau Method

the Successfulness Prediction Model

Gaussian Process Regression

Reputation Mapping

Successfulness Threshold

Results

Evaluation of Star Rating's Incitement

Association between Review Wording and Rating Levels

Our Strategies

Marketing Strategies

Important Design Features

Sensitivity Analysis

Strengths and Weaknesses

Strengths

Weaknesses

Conclusion

Our Letter

Appendices

2007707.pdf‎

2009116.pdf‎

2010638.pdf‎

Problem Chosen C 2020 MCM/ICM Summary Sheet Team Control Number 2002116 Riddle of Sphinx: Cracking the Secret of Amazon’s Ratings and Reviews Summary We have witnessed the rise of mass online marketplaces. For example Amazon, one of the biggest online platforms, is worth around $ 915 billion. Guided by the customer obsession principle, it provides an opportunity for the customers to rate the products from 1 to 5. More- over, buyers can submit a text-based message, namely review, to express their feeling towards the products. The massive data of those ratings and reviews offer a wealth of information re- mained to be mined. Analysis of text-based messages or rating-based values has received wide attention, yet there is not a method severs as the combination of both, especially for the case of an online marketplace. To address the above-mentioned challenge, we propose a novel CE-VADER hybrid model for sentiment analysis in reviews, classifying messages into ﬁve groups of strong positive, weak positive, moderate, weak negative and strong negative. Empirical results indicate that the pro- posed ﬁve-group classiﬁcation model correlates to the ﬁve-star rating system well. Then a state-of-art informative evaluation model is proposed as the combination of the text-based and rating-based measures. We pick out 1% most informative reviews and ratings of each product to evaluate the properties and propose sales strategies. We propose the “reputation” rate based on the differential equation model in the literature to evaluate the reputation of the product. Then we employ an Auto Regression (AR) model as the time series forecasting method to predict future “reputation” rate and the potential success or the failure of each product. AR model shows high accuracy on the validation set with a maximum Root Mean Square Error (RMSE) of 0.131. Paciﬁers have a good reputation and pre- dicted to be successful while microwaves and hair dryers have bad reputations and predicted to fail. The results show relevance with the proportions of the continuous ﬁve-star or one-star rating sequence. Lastly, we analyze speciﬁc words and descriptors to ﬁnd their correlation to the ratings. According to our empirical results, we propose some conﬁdent sales strategies and recom- mendations for the online marketplace, e.g., the timing choice of introducing products into market, targeted adjustment according to star ratings, etc. We write a letter to the marketing director of Sunshine Company to summarize our analysis and results, together with our rec- ommendations. Our framework shows a strong accuracy, robustness. It can be easily implemented to other data with our source codes. Keywords: Text-Based Measure, Informative Text Selection, Reputation Quantiﬁcation, Sales Strategy Formation.

Team # 2002116 Page 1 of 38 Riddle of Sphinx: Cracking the Secret of Amazon’s Ratings and Reviews March 9, 2020 Contents 1 Introduction 2 Assumptions and Notations . . 2.1 Assumptions . 2.2 Notations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Informative Evaluation Model 3.1 Vector Encoding Forms of Star Ratings . . . . . . . . . . . . . . . . . . . . . . . . 3.2 Contextual Entropy VADER Hybrid Model for Text-Based Measures . . . . . . 3.2.1 Manually Annotating the Seed Word . . . . . . . . . . . . . . . . . . . . 3.2.2 Contextual Entropy Block (CE) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.2.3 VADER Block . . . . . . . . . . . . . . . . 3.2.4 Proposed CE-VADER for Sentiment Analysis . 3.3 Combination of Text-Based and Rating-Based Measures . . . . . . . . . . . . . 3.4 Model Implementation, Sensitivity Analysis and Results . . . . . . . . . . . . . . . 4 Difference Equation to Measure Time-Based Pattern 4.1 Difference Equation Based Model . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.2 Model Implementation, Sensitivity Analysis and Results . . . . . . . . . . . . . 5 Predict Potential Success or Failure . 5.1 Time Series Forecasting for Predicting Future Reputation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.2 Evaluating the Success or Failure potential 5.3 Model Implementation and Results . . . . . . . . . . . . . . . . . . . . . . . . . . 6 Speciﬁc Ratings and Descriptors Analysis 6.1 6.2 . Speciﬁc Star Ratings Relevance to Rating Frequency . . . . . . . . . . . . . . . Speciﬁc Quality Descriptors’ Relevance to Rating Levels . . . . . . . . . . . . . 6.2.1 Naive Bayesian Model for Evaluation . . . . . . . . . . . . . . . . . . . . 3 4 4 4 4 5 5 7 7 9 9 10 11 11 11 12 14 14 14 14 15 16 18 18 . . . . . . . . . . . . . . . . . .

Team # 2002116 Page 2 of 38 6.2.2 Model Implementation and Results . . . . . . . . . . . . . . . . . . . . . . 7 Attractiveness Analysis of Design Features 8 Sales Strategies and Recommendations 9 Strengths and Weaknesses . . Strengths . . 9.1 9.2 Weaknesses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Conclusion 11 A Letter to the Marketing Director of Sunshine Company Appendices Appendix A Annotated Seed Words and Frequency Appendix B The Number of Keyword Occurrences in Different Keyword Groups Appendix C Top 1% Most Informative Ratings and Reviews Appendix D Source Code for VADER Sentiment Analysis Appendix E Source Code for Informative Algorithm Appendix F Source Code for Reputation Calculation Appendix G Source Code for Beyes Model Appendix H Source Code for Time Series Prediction Appendix I Source Code for Wordcloud Picture 19 20 20 21 21 22 22 22 24 24 25 26 36 36 37 37 38 38

Team # 2002116 1 Introduction Page 3 of 38 Our society has witnessed the rise of many online marketplaces, with a total worldwide market value of 4.3 trillion dollars [1]. One salient feature of the online marketplace compared with traditional platforms is the massive review of texts and ratings. Among all of them, Ama- zon has received the most attention, as its greatest success [1]. Amazon also provides customers with chances to freely express their feeling and rate the products that they have purchased. Previous work [2] indicates that customers will largely refer to the reviews and ratings be- fore they buy the product on the platforms. Platforms can adjust their sales strategy by checking these comments. Hence, the ratings and the reviews both provide references to other potential buyers and massive data to analyze the demand of the customers, which can help to develop adaptive strategies. By making full use of these data, we can achieve a win-win situation for both the buyers and the platform. One of the biggest challenges is the complexity and diversity of the texts of the reviews [3, 4]. In this paper, we propose a novel sentiment analysis model as the text-based measure to address this issue. In this paper, we develop a series of models as the combination of text-based, rating- based, and time-based measures to pick out the most informative ratings and reviews to track. We also construct a novel evaluation framework to quantify the reputation of each product and predict potential success or failure. Then, we analyze the correlation between continuous same star ratings, word descriptors and the reputation of the products. We implement our model on the real data set generated from three different types of products, namely the paciﬁer, microwave, and the hair dryer. Researchers have pointed out the necessity to study when and how the online platforms should adjust their marketing communication strategy in response to consumer reviews or rat- ings [5]. We propose several sales strategies and recommendations in this paper based on our analysis and results. The rest of the paper is organized as follows. In section 2, we list the main assumptions in model construction and introduce the notations which will be frequently used in this paper. In section 3, a novel Information Evaluation Model is proposed. It is made up of a hybrid the state-of-art CE [6] and VADER [7] for sentiment analysis in the review text. Then we propose the "importance" rate as a combination of text-based measure (i.e., our proposed CE-VADER model) and ratings-based measure (i.e., the star-rating and the helpful votes) to indicate how informative the review and the rating are. To the best of our knowledge, we are the ﬁrst to propose a review-text-based sentiment analysis model. In section 4, we employ a difference equation model as the backbone to measure the time pattern of each product. Moreover, the "reputation" rate is proposed in this section to measure the growth or the decline of the repu- tation. In section 5, we employ an Auto Regression model (AR) to predict the change of rep- utation in the future time domain and propose a fuzzy system to predict the potential success or failure of each product. More details about the results of our model implemented on given data can be found in section 6,7,8. The strengths and weaknesses of the proposed model and framework are discussed in section 9. We conclude in section 10. All source codes are attached to the Appendix D-I and can be easily implemented to other data sets.

Team # 2002116 Page 4 of 38 2 Assumptions and Notations 2.1 Assumptions To simplify our model and eliminate the complexity, we make the following main assump- tions in this literature. All assumptions will be re-emphasized once they are used in the con- struction of our model. Assumption 1. The online marketplace operates stably. And there were no situations such as an out- break of an epidemic which would seriously affect the production chain of online shopping. Assumption 2. The ratings and reviews depict customers’ real experience and feeling about their pur- chased products. The sentiment in the review text reﬂects one’s feelings on the products. Assumption 3. The vast majority of individual differences of customers e.g., economic status and edu- cational level, are ignored. Assumption 4. It takes some time for shipping the product. Some customers would prefer making reviews sometime after receiving the purchased products. Assumption 5. Consumers pay more attention to the negative comments e.g., low-star rating or nega- tive reviews when purchasing the products. 2.2 Notations In this work, we use the nomenclature in Table 1 in the model construction. Other none- frequent-used symbols will be introduced once they are used. Table 1: Notations used in this literature Symbol id sid hvid Rid rdid VEC INT IMP REP Deﬁnition review id Type String Scalar Scalar String Date Star rate, subscript is its associated review id Helpful votes, subscript is its associated review id Review text, subscript is its associated review id Review date, subscript is its associated review id Vector encoding of the star rating Mapping Vector encoding of intensity relevant to 5-class seed words Mapping Mapping Mapping Importance rate of review and associated rating Reputation rate of product at some time 3 Informative Evaluation Model In this section, we proposed the "importance" to evaluate how informative the review text and star rates are. The most informative factor we take into account is the sentiment of the review text. In this literature, we propose a CE-VADER model to address the sentiment analysis issue in the review text. Our model will classify the text into ﬁve groups: strong positive, weak positive, moderate, weak negative and strong negative in the consistency of the ﬁve-star rating scheme. Then our proposed "importance" will incorporate the text-based measure, star rating

Team # 2002116 Page 5 of 38 with their ﬁdelity, correlation. The higher the importance, the more informative it is. The rest of the section is arranged as follows. In section 3.1, we covert the integral star rate to vector form. In section 3.2, we propose the CE-VADER, a hybrid model for the text-based measures. In section 3.3, we introduce the "importance" to calculate how informative the review and the star rating together are. In section 3.4, we implemented our model on real data set of 3 types of products to indicate 1% most informative review and star ratings, and analyze the model sensitivity. 3.1 Vector Encoding Forms of Star Ratings Consumers can freely express their comments on the products on Amazon by rating one to ﬁve stars after purchasing. A one-star rating is associated with the least satisfaction while ﬁve-star with the highest satisfaction. The one-to-ﬁve star rating itself is a sufﬁcient measure. To combine the ratings-based measure with the text-based measure which we will discuss in the next section. We would like to convert the star-rating to vector forms in this section. Firstly, we calculate the ration of each star rate of hair dryers, baby paciﬁers, and mi- crowaves from the given data respectively, as shown in Figure 1. We observe that baby paciﬁers have received the highest percentage of high star rating, while microwaves a lower star rating. Products with high technology content also face more quality problems, which is in line with actual expectations, indicating that star ratings can indeed reﬂect consumer satisfaction. However, we would like to convert the rating to an equivalent 5−dimension vector en- coding forms. Denote the star-rate as s ∈ {1, 2, 3, 4, 5}, the vector encoding forms of s can be formulated by VEC(s) = (vec1 s)T ∈ R5 where the components deﬁned by: s,··· , vec5 veci s = |i−s|2 e 2σ05 j=1 e |j−s|2 2σ0 (1) where σ0 is a tunable parameter, determining the robustness of our model, the bigger the more robust. The mapping VEC is one-to-one, hence we claim the converted form is equivalent to the star-rating. Moreover, by our deﬁnition, we can ﬁnd: i) s = argmaxi{veci s = 1. VEC(s) encoding as a probabilistic vector with each component represents their possibility to be rated by the associated star e.g., the 4-star rate has the highest probability to be rated 4 stars, second-highest possibility to be rated as 3 or 5 stars. i=1 veci s}, ii)5 3.2 Contextual Entropy VADER Hybrid Model for Text-Based Measures In this literature, we construct a novel model for sentiment analysis based on the review text. To the sake of simplicity, the sentiment scored by the text is regarded as the only fact to measure the success or the failure of the product e.g., the positive attitudes usually indicate a higher potential of the product success while on the contrary, negative attitudes indicate a higher possibility for the product failure. In this section, we propose a contextual entropy and VADER [7] hybrid model, namely the CE-VADER to address the sentiment analysis challenge in the review text. The model is made up of the two blocks: the contextual entropy (CE) block and the VADER block. CE model shows its high capability in sentiment analysis of the stock market news [6] but its limitation in short

Team # 2002116 Page 6 of 38 Figure 1: Star rating distribution of paciﬁer, microwave and hair dryer based on the given data. context e.g., the review in this literature. While the VADER model outperforms the state-of-art nature language model in short online texture but their accuracy depends a lot on the pre-listed lexicon words. The two blocks of CE-VADER will separative outcome a 5−dimension prob- abilistic vector with each component represents the probability of being its associated group. After a voting block, our proposed model will classify the review into one of the ﬁve groups together with an intensity showing how intense it is to be classiﬁed into the group. By hybridiz- ing both models, we show CE-VADER can classify the review context into ﬁve groups in the consistency of the star rating. The rest of this section is arranged as follows. Firstly we show our strategy to generate the seed words for the CE model and expanding the gold-standard list of the VADER block. Then we introduce the CE and VADER model superlatively. Finial we propose the hybrid CE- VADER model. Our model will classify the review context into ﬁve classes: strong positive, weak positive, moderate, weak negative and strong negative, with its intensity. In the next section, we will propose a review ﬁdelity based on the classiﬁcation results and the intensity. Seed word candidate Annotated Seed word generate Seed words for CE Extend gold-standard list Strength of co-occurrence analysis Contextual distribution analysis Sentiment analysis by VADER Aggregation Sentiment class and intensity Figure 2: The overall architecture of our proposed CE-VADER model. The model is made up of two blocks, namely the CE block and the VADER block. 0%10%20%30%40%50%60%70%80%90%100%hair dryermicrowavepacifier⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐1266027161426300Total number189391615114709451192667134112402670420969996391032

Team # 2002116 Page 7 of 38 3.2.1 Manually Annotating the Seed Word We put 80% of the data as the training set and all the rest 20% as the testing set of eval- uations. Sentences in the review body from the training set are broken down into separated words, among which are statistically calculated their frequency. The high-frequency emotion words are picked out as seed words and manually annotated by us, while the low-frequency ones are discarded. The annotator (one of our group members) will incorporate his expertise natural-language processing knowledge for the classifying all the selected emotion words into ﬁve groups i.e., strong positive, weak positive, moderate, weak negative and strong negative. Instead of coarse two-group annotations of either "positive" or "negative" [6], we detailedly sub-classify each one into "strong" and "weak" subgroups and set aside one more group labeled as "moderate". The ﬁve-group annotation strategies aim to correlate with the one-to-ﬁve-star- rating score e.g., "weak positive" maps to the four-star-rating. We denote the ﬁve groups of the seed word as Gi, with i = 1, 2, 3, 4, 5. Representative words generated from the training set with "positive" or "negative" labels are depicted in Figure 3 with the cold tone or warm tone respectively. Annotated ﬁve-class seed words are attached to Appendix A. Figure 3: Demonstration of some representative seed words. Words annotated as "positive" are colored in light tone while the "negative" ones in a dark tone. The bigger the size, the higher word frequency. 3.2.2 Contextual Entropy Block (CE) The contextual entropy block employs a part-contextual entropy model [6] as the back- bone architecture. A part-contextual entropy model can consider both the strength of the co- occurrence and the contextual distribution between the candidate of the most representative words from the review context and the generated seed words. We employ a vector to encode the strength between word and its context in the review. To be more speciﬁc, denote the left and the right context of the kth word wk in the n-word review context R = w1w2 ··· wkwk+1 ··· wn are {w1w2 ··· wk−1} and {wk+1wk+2 ··· wn} respectively. Note that we set review as a complete target instead of breaking it into sentences as in the ref.[6], in consideration of the short contextual style of online comments. The dimension of the vector depends on the length of the review i.e.,n. Denote the vector to record the left context of word wk as vlef t(wk) = (vlef t wkN ) and the vector to record the right context as vright(wk) = wk2,··· , vlef t wk1, vlef t

分享到：

赞收藏

资料库

2020美赛特等奖C题论文.pdf

相关推荐

考试认证

热门标签

最新资料