Premium Synthetic Dataset

GOAT: Your
Real-World
Electronics & D2C Data Playground

GOAT is a ready-to-use synthetic dataset that mimics the experience of a real direct-to-consumer electronics brand. Ideal for demand forecasting, customer segmentation, product recommendation engines, and consumer behavior analysis.

GOAT mascot

Overview

GOAT is a synthetic dataset designed to mirror the operations of a direct-to-consumer electronics brand, focusing on products like audio gear, wearables, and smart accessories. It offers detailed data on users, transactions, product listings, reviews, inventory, and support tickets, providing a rich environment for analyzing consumer behavior, marketing strategies, and post-purchase interactions. Whether you're working on demand forecasting, A/B testing, or campaign performance, GOAT offers a realistic framework for exploring these use cases.


Ideal for data scientists, developers, and analysts, GOAT supports a variety of applications, from machine learning tasks like customer lifetime value modeling and product recommendation to backend testing of inventory systems and APIs. It also provides an excellent resource for learners to practice SQL, data transformations, and ETL workflows. With its comprehensive and structured data, GOAT allows you to dive into customer behavior, sales trends, and operational optimization in the consumer electronics space.



Full D2C Electronics Lifecycle

Simulates a complete D2C electronics brand experience, including product catalog, purchases, fulfillment, customer feedback, warranty handling, and returns.

Built for Development & Testing

Excellent for building and testing customer segmentation models, sales prediction engines, return forecasting systems, and marketing funnel optimization pipelines.

Backend Feature Testing

Useful for developers working on inventory sync, order status workflows, logistics integration, and payment processing systems.

Rich Transaction Data

Supports use cases in consumer behavior analysis, seasonal product trend tracking, influencer impact studies, and marketing funnel optimization.

Analytics & Research

Suitable for SQL practice, data transformations, ETL workflows, A/B testing simulations, and product analytics in the consumer electronics space.

How it Works

01

AI-Generated & Fully Synthetic

The GOAT dataset is a synthetic representation of an e-commerce platform focused on audio and tech accessories. Using AI agents trained on digital commerce behaviors, the data captures natural interactions between users and a product catalog — with zero real transactions or consumer data.

02

Realistic Simulation with Privacy

It simulates product listings, image galleries, order placements, user reviews, discount applications, and payment workflows — without any real transactions or consumer data, ensuring ethical use across all e-commerce applications.

03

High-Quality & Safe for Use

Built for developers and analysts to test e-commerce platforms, especially those involving electronics and consumer goods in a fully synthetic environment — 100% privacy-compliant and ready to use.

Dataset Schema

A comprehensive relational model representing a modern D2C electronics e-commerce platform engineered for deep analysis and complex querying.

Users

Stores user data like login details, contact info, and role (customer/admin), linked to orders, payments, and reviews.

Products

Contains product details such as name, description, price, stock, and category (headphones, earphones, speakers).

Product Images

Stores images for products (front, back, side views) linked to the respective products.

Orders

Tracks customer orders, including total amount, status (pending, completed), and shipping and billing info.

Order Items

Details individual items in an order, including product, quantity, unit price, and total price.

Payments

Stores payment details for orders, including method, amount, status (pending, completed), and transaction reference.

Reviews

Records customer reviews for products, including ratings (1-5 stars) and feedback, linked to users and products.

Discounts

Defines discount codes with percentage and validity period for promotions.

User Activities

Logs user actions like viewing products, adding items to cart, and making orders/payments.

Notifications

Stores notifications for users (order updates, offers) with read/unread status for engagement.

Available formats
  • CSV
  • JSON
  • Excel
Supported databases
  • MySQL
  • PostgreSQL
  • SQL Server
Cloud access
  • Snowflake