GOAT: Your
Real-World
Electronics &
D2C
Data Playground
GOAT is a ready-to-use synthetic dataset that mimics the experience of a real direct-to-consumer electronics brand. Ideal for demand forecasting, customer segmentation, product recommendation engines, and consumer behavior analysis.
Overview
GOAT is a synthetic dataset designed to mirror the operations of a direct-to-consumer electronics brand, focusing on products like audio gear, wearables, and smart accessories. It offers detailed data on users, transactions, product listings, reviews, inventory, and support tickets, providing a rich environment for analyzing consumer behavior, marketing strategies, and post-purchase interactions. Whether you're working on demand forecasting, A/B testing, or campaign performance, GOAT offers a realistic framework for exploring these use cases.
Ideal for data scientists, developers, and analysts, GOAT supports a variety of applications, from machine learning tasks like customer lifetime value modeling and product recommendation to backend testing of inventory systems and APIs. It also provides an excellent resource for learners to practice SQL, data transformations, and ETL workflows. With its comprehensive and structured data, GOAT allows you to dive into customer behavior, sales trends, and operational optimization in the consumer electronics space.
Full D2C Electronics Lifecycle
Simulates a complete D2C electronics brand experience, including product catalog, purchases, fulfillment, customer feedback, warranty handling, and returns.
Built for Development & Testing
Excellent for building and testing customer segmentation models, sales prediction engines, return forecasting systems, and marketing funnel optimization pipelines.
Backend Feature Testing
Useful for developers working on inventory sync, order status workflows, logistics integration, and payment processing systems.
Rich Transaction Data
Supports use cases in consumer behavior analysis, seasonal product trend tracking, influencer impact studies, and marketing funnel optimization.
Analytics & Research
Suitable for SQL practice, data transformations, ETL workflows, A/B testing simulations, and product analytics in the consumer electronics space.
How it Works
AI-Generated & Fully Synthetic
The GOAT dataset is a synthetic representation of an e-commerce platform focused on audio and tech accessories. Using AI agents trained on digital commerce behaviors, the data captures natural interactions between users and a product catalog — with zero real transactions or consumer data.
Realistic Simulation with Privacy
It simulates product listings, image galleries, order placements, user reviews, discount applications, and payment workflows — without any real transactions or consumer data, ensuring ethical use across all e-commerce applications.
High-Quality & Safe for Use
Built for developers and analysts to test e-commerce platforms, especially those involving electronics and consumer goods in a fully synthetic environment — 100% privacy-compliant and ready to use.
Dataset Schema
A comprehensive relational model representing a modern D2C electronics e-commerce platform engineered for deep analysis and complex querying.
Users
Stores user data like login details, contact info, and role (customer/admin), linked to orders, payments, and reviews.
Products
Contains product details such as name, description, price, stock, and category (headphones, earphones, speakers).
Product Images
Stores images for products (front, back, side views) linked to the respective products.
Orders
Tracks customer orders, including total amount, status (pending, completed), and shipping and billing info.
Order Items
Details individual items in an order, including product, quantity, unit price, and total price.
Payments
Stores payment details for orders, including method, amount, status (pending, completed), and transaction reference.
Reviews
Records customer reviews for products, including ratings (1-5 stars) and feedback, linked to users and products.
Discounts
Defines discount codes with percentage and validity period for promotions.
User Activities
Logs user actions like viewing products, adding items to cart, and making orders/payments.
Notifications
Stores notifications for users (order updates, offers) with read/unread status for engagement.
- CSV
- JSON
- Excel
- MySQL
- PostgreSQL
- SQL Server
- Snowflake