Skip to content
English library

SNAPAmazon

Offering a broad and deep view into consumer sentiments and opinions.

Play icon crypto ? lives reviews SNAP

Dataset Overview

This dataset compiles 34,686,770 Amazon reviews from 6,643,669 users across 2,441,053 products, offering deep insights into consumer sentiments.

Data Origin

Derived from the Stanford Network Analysis Project (SNAP), this dataset spans an 18-year period up to March 2013, providing a broad historical perspective on consumer opinions.

Sentiment Classification

Reviews are categorized into negative and positive sentiments based on their ratings, with scores of 1-2 considered negative and 4-5 positive, excluding neutral reviews.

Dataset Structure

The dataset is meticulously structured, featuring 1,800,000 training samples and 200,000 testing samples for each sentiment category, designed for polarity analysis.

File Format

Provided in CSV format, each file includes three columns: polarity, title, and text, representing the sentiment classification, review heading, and body of the review, respectively.

Usage Guidelines

Researchers and developers can utilize this dataset to analyze trends, perform sentiment analysis, and develop machine learning models to better understand consumer behavior.

Find the plan that's right for you, each plan includes

docs iconsDocs
sheets iconsSheets
slides iconsslides
forms iconsforms
keep iconskeep
sites iconssites
drive iconsdrive
gmail iconsgmail
meet iconsmeet
calendar iconscalendar
Chat_icon@1x iconsChat
docusaurus_keytar iconsjup
docusaurus iconsBusiness
GoogleMaps iconsGoogleMaps

Released under the MIT License.

reviews has loaded