Fake Review Detection in 2024: How it works & 3 Case Studies
Figure 1. The popularity of the keyword “fake review detection” on Google search engine between 2019-2024.
While shopping online, ~80% of customers read online reviews or comments on products and services before any purchase. However, some of these reviews can be fraudulent as promoting certain products or depreciating them and, as a result, misguide buyers. Indeed, 2.700,000 fake reviews were detected in 2021, which makes up ~50% of consumer reviews with five star ratings. 1
This article explains how fake reviews are created, AI or machine learning methods used to detect deceptive consumer reviews, and real-life applications for identifying them.
How are fake reviews generated?
Source: ReviewTrackers
Figure 2. Comparison of a fake review with a review by a real user
Fake reviews are mainly written in two ways: human-generated and machine-generated.
Human-generated fake reviews
Content creators get paid to create fake online reviews, and they promote or depreciate certain products in their reviews. In general, there exist three patterns:
- The owner of the products can pay content creators to write feedback to obtain higher ratings or impress potential customers.
- Or, competitors may hire spammers to demonize the products of other brands and try to direct customers to alternatives, in that case, their products.
Machine-generated fake reviews
User generated content is time-consuming, labor-intensive, and costly when it is done manually. Therefore, automated algorithms (e.g., Natural Language Processing (NLP) and Machine Learning (ML) methods) are applied to create fake reviews. Contrary to human-generated reviews, machine-generated reviews are produced through text generation, which can generate reviews on a large scale.
With the advance of generative AI tools such as ChatGPT, companies can also generate fake reviews writing related prompts (see figure below). Unfortunately, this poses challenge to detecting fake reviews as the examples below resemble real person sentences.
Figure 3. Example of fake reviews generated by ChatGPT-4.
Watch how fake reviews on Google can affect local businesses.
Fake review detection methods
Manual detection
It is the most basic way of detecting fake reviews, and annotators manually decide whether a review is fake. Although it can be a promising approach, research shows that humans have 57% accuracy in a fake review detection task.2 Besides, as there is an exponential increase in online reviews, it requires a great workforce and time.
Sponsored
Resonio, a brand of Clickworker, has more than 5 million users globally and assists businesses in collecting survey data, feedback from customers, and market research. With the assistance of their international network, which is headquartered in more than 160 countries, they provide their customers platform for conducting surveys and targeted survey respondents.
If interested, here is our data-driven list of survey participant recruitment services and survey tools.
Algorithm-based detection
The number of online reviews on TripAdvisor has increased from 200 million to 1 billion from 2014 to 2021.3 As customers’ reviews increase exponentially, so do fake reviews. Machine learning techniques provide a solution to detect online spam. ML algorithms analyze the texts based on
- Textual (e.g., nouns, phrases, punctuation, linguistic style) features
- Behavioral (e.g., number of reviews, review dates, user profile) features
Then, algorithms make classifications based on these features. Recent research applying the -means algorithm, an ML method, achieves 96% accuracy in detecting fake reviews.4
Algorithms can be trained to detect fake reviews through textual features such as
- Excessive punctuation use
- Poor grammar
- An overly negative or positive tone
You can utilize generative AI tools to help detect fake reviews. To do this effectively, first, provide the algorithm examples of fake reviews along with explanations focusing on the cues that might indicate being fake. Then, you can present a set of reviews and ask the algorithm to identify which ones might be fake.
Researchers use sentiment analysis methods to identify fake reviews based on textual features. Sentiment analysis identifies opinions or feelings in texts as containing positive, negative, or neutral tones.
For those interested, here is our data-driven list of sentiment analysis services.
Algorithms can also monitor the behavioral pattern of reviewers, such as the user’s total number of reviews, review dates, and user profile details. These metrics allow ML models to classify suspicious reviews and help determine fake review characteristics.
You can also check our article on fraud detection.
Case studies of fake review detection
1- Sentiment analysis on Amazon reviews
Source: CSI Transactions on ICT
Figure 4. Flow diagram of the study on detecting fake reviews through sentiment analysis
Researchers collected ~40,000 reviews through web scrapers from the Amazon website and conducted sentiment analysis, classifying texts based on their sentiment score as positive, negative, or neutral. Then they determined a sentiment threshold to detect suspicious reviews and applied Random Forest classification based on the scores obtained. Their results showed 91% accuracy in detecting fake reviews.5
2- Feature engineering on Yelp Restaurant and Hotel reviews
Researchers conduct feature engineering to the processed data using ML techniques based on two datasets: Yelp Restaurant and Hotel online reviews. They compared various ML models on these datasets and found that logistic regression performs better than the other algorithms, providing 88% accuracy in detecting fake reviews.6
3- Classification of fake reviews on the App Store
Researchers used the Apple App Store dataset containing 22+ million reviews from 1.4 million apps to detect fake reviews. Results show that ~66 million (35% of all reviews) were fake.7 Among those, 60,000 were written by a single spammer.
Real-life applications of how companies fight against fake reviews
Yelp fake reviews consumer alert
Source: Yelp Blog
Figure 5. Example of a fake review alert on Yelp
Yelp detects that some sellers buy fake reviews. After detecting fake review buyers, Yelp warns potential customers about their fraudulent actions. They aim to shame sellers that buy online spammers to write positive reviews for their brands.
Amazon files back suit for those buying fake reviews on Facebook
Amazon has 12,000+ employees working on fraud or abuse, and they discovered 10,000 Facebook groups in 2022 created to buy fake reviews in exchange for money or free products.8 The company announced that it had taken proactive legal action to remove the groups and find the bad actors.
Further Reading
- Ad Fraud: What Is It, How It Works, & How to Combat It?
- The Ultimate Guide to Avoiding CAPTCHAs in Web Scraping
- Generative AI Ethics: Top 6 Concerns
Don’t hesitate to contact us if you have any questions:
External Links
- 1. “Distribution of online fake reviews that were removed in 2021, by star rating” Statista. Retrieved September 19, 2023.
- 2. Plotkina, D., Munzel, A., & Pallud, J. (2020). “Illusions of truth—Experimental insights into human and algorithmic detections of fake online reviews.” Journal of Business Research, 109, 511-523. Retrieved January 17, 2023.
- 3. “Total number of user reviews and opinions on Tripadvisor worldwide from 2014 to 2021“. Statista. February 21, 2022. Retrieved January 17, 2023.
- 4. Li, J., Lv, P., Xiao, W., Yang, L., & Zhang, P. (2021). “Exploring groups of opinion spam using sentiment analysis guided by nominated topics.” Expert Systems with Applications, 171, 114585. Retrieved January 17, 2023.
- 5. Saumya, S., & Singh, J. P. (2018). “Detection of spam reviews: a sentiment analysis approach.” CSI Transactions on ICT, 6(2), 137-148. Retrieved January 17, 2023.
- 6. Jain, P. K., Pamula, R., & Ansari, S. (2021). “A supervised machine learning approach for the credibility assessment of user-generated content.” Wireless Personal Communications, 118(4), 2469-2485. Retrieved January 17, 2023.
- 7. Martens, D., & Maalej, W. (2019). “Towards understanding and detecting fake reviews in app stores.” Empirical Software Engineering, 24(6), 3316-3355. Retrieved January 17, 2023.
- 8. “Amazon targets fake review fraudsters on social media.” Amazon. July 19, 2022. Retrieved January 17, 2023.
Cem has been the principal analyst at AIMultiple since 2017. AIMultiple informs hundreds of thousands of businesses (as per similarWeb) including 60% of Fortune 500 every month.
Cem's work has been cited by leading global publications including Business Insider, Forbes, Washington Post, global firms like Deloitte, HPE, NGOs like World Economic Forum and supranational organizations like European Commission. You can see more reputable companies and media that referenced AIMultiple.
Throughout his career, Cem served as a tech consultant, tech buyer and tech entrepreneur. He advised businesses on their enterprise software, automation, cloud, AI / ML and other technology related decisions at McKinsey & Company and Altman Solon for more than a decade. He also published a McKinsey report on digitalization.
He led technology strategy and procurement of a telco while reporting to the CEO. He has also led commercial growth of deep tech company Hypatos that reached a 7 digit annual recurring revenue and a 9 digit valuation from 0 within 2 years. Cem's work in Hypatos was covered by leading technology publications like TechCrunch and Business Insider.
Cem regularly speaks at international technology conferences. He graduated from Bogazici University as a computer engineer and holds an MBA from Columbia Business School.
To stay up-to-date on B2B tech & accelerate your enterprise:
Follow on
Comments
Your email address will not be published. All fields are required.