CamelCamelCamel System Design: building a price tracking platform at scale

Every Black Friday, millions of shoppers refresh their browsers hoping to catch that perfect deal. But what if you could stop refreshing entirely and let a system watch prices for you around the clock? CamelCamelCamel does exactly this for over 18 million Amazon products, sending alerts the instant prices drop below your threshold.

The engineering challenge behind this seemingly simple service involves continuous data collection from one of the world’s largest e-commerce platforms. It also requires storing billions of price points efficiently and delivering notifications within seconds of a price change.

This guide breaks down the complete architecture of a CamelCamelCamel-style system. You will follow the journey from the moment a price changes on Amazon to the notification landing in your inbox. You will learn how to design scrapers that respect rate limits while maintaining data freshness. You will also explore time-series databases optimized for write-heavy workloads and event-driven alerting pipelines that scale to millions of users without sending duplicate notifications.

Whether you are preparing for System Design interviews or building your own tracking service, these patterns form the foundation of modern data-intensive applications.

The following diagram illustrates the high-level architecture of a price tracking system, showing how data flows from Amazon through collection, storage, and notification layers to reach end users.

High-level architecture of a price tracking system like CamelCamelCamel

Understanding what the system actually does

Before diving into databases and message queues, you need clarity on what users expect from a price tracking platform. CamelCamelCamel serves three primary functions that drive every architectural decision.

First, it displays price history charts showing how a product’s cost has fluctuated over months or years, helping users identify patterns and optimal buying windows. Second, it sends price drop alerts when products fall below user-defined thresholds, requiring near real-time processing of incoming price data. Third, it provides historical trend analysis so users can determine whether current prices represent genuine deals or artificial inflation before a sale.

The technical challenge emerges from the scale and frequency of these operations. Unlike typical web applications where traffic drives system load, price tracking operates continuously regardless of user activity. Scrapers run around the clock, price comparisons execute on every update, and storage grows whether anyone visits the site or not. This background-heavy workload fundamentally shapes the architecture and pushes the system toward an event-driven architecture where components react to data changes rather than user requests.

Real-world context: CamelCamelCamel tracks approximately 18 million Amazon products across multiple regional storefronts, with each product potentially updating multiple times daily. This translates to roughly 50-100 million price checks per day during normal operations, with traffic spiking 5-10x during Prime Day and Black Friday events.

Functional requirements that shape the design

Every System Design begins with functional expectations that define what the system must accomplish. The system must fetch product data reliably from Amazon’s Product Advertising API or through ethical web scraping, handling rate limits and temporary failures gracefully. It must store and index historical pricing data in a format optimized for both rapid writes during ingestion and efficient reads during chart rendering.

Alert functionality requires comparing incoming prices against potentially millions of user-defined thresholds. It also requires triggering notifications through email, push, or webhook channels within seconds of detecting a match.

The frontend requirements add another dimension of complexity that competitors often address through server-side rendering versus client-side rendering trade-offs. Users expect price charts to load instantly even when displaying years of data points, and product searches should return results quickly across millions of items.

Watchlists and alert configurations must persist reliably. Supporting user profiles means maintaining separate data stores optimized for different access patterns. User preferences change rarely but price history grows constantly, creating a natural separation between hot and cold data paths.

Non-functional requirements that determine success

Non-functional goals define how the system behaves under stress and at scale, often determining whether the architecture succeeds or fails in production. Scalability stands paramount since the system must handle millions of products and thousands of concurrent users without degradation. Availability ensures users can access data even during high-traffic periods like Prime Day when interest in price tracking peaks.

Performance targets specify that API responses should complete in under 200 milliseconds for cached data. Notifications should reach users within seconds of a qualifying price change.

Reliability guarantees data accuracy and prevents missing updates that could cause users to miss deals they were tracking. Cost efficiency matters significantly because the system generates revenue primarily through affiliate links, meaning infrastructure costs directly impact profitability. A system that tracks prices accurately but costs more to operate than it generates in revenue serves no business purpose.

These requirements create a foundation for decisions ranging from database selection to caching strategy. Understanding them deeply enables intelligent trade-offs throughout the design process.

How data flows through the architecture

Every high-performing system starts with a clear understanding of how data moves through it. In price tracking, data begins its journey at Amazon’s product listings and ends at the user interface where charts, alerts, and analytics appear. The path between these endpoints involves multiple processing stages, each designed to handle specific transformations while maintaining throughput and reliability. This event-driven architecture allows each layer to scale independently and fail gracefully without cascading failures.

The following diagram shows the end-to-end data flow from Amazon product pages through the collection, processing, and serving layers to user notifications and frontend displays.

End-to-end data flow from Amazon to user notifications

The collection phase begins when scraper services retrieve the latest prices and product metadata from Amazon. Each request gets logged and timestamped to track update frequency and identify stale data. The collected information then moves to a message queue like Kafka or RabbitMQ, which decouples ingestion from processing and allows each layer to scale independently.

This queueing approach prevents data overload during peak collection periods. It ensures no price updates get lost if downstream services temporarily slow down.

Watch out: Synchronous data flows create dangerous coupling between system components. If your processing service slows down and you lack a queue buffer, backpressure propagates upstream and can cause scraper failures or data loss during traffic spikes. Always design with asynchronous boundaries between major components.

Worker nodes consume messages from the queue and perform the heavy lifting of data transformation. They clean incoming data by removing duplicates, handle missing values through interpolation or flagging, and standardize product identifiers across different Amazon storefronts. The processed data then writes to persistent storage, with price history going to time-series databases while metadata and user alerts live in relational or document databases.

When users request product pages, the backend checks cache layers before hitting the database. Cached results drastically reduce response times for popular products.

The final stage involves serving processed data to users and triggering notifications when appropriate. The API layer retrieves requested information and transforms it into the format needed for frontend chart rendering. Meanwhile, the alerting service monitors incoming prices against user thresholds and publishes notification events when matches occur.

This parallel processing ensures that user requests receive fast responses while alert evaluation happens independently in the background. The decoupled microservices architecture allows each layer to fail and recover independently while maintaining high availability.

Building the scraping and data collection layer

The heart of any price tracking system lies in its data collection mechanism, which determines whether price data remains accurate and current. CamelCamelCamel relies on two primary data sources that complement each other based on availability and rate limits.

The Amazon Product Advertising API provides the official and reliable method for fetching product details and prices. It comes with strict rate limits and authentication requirements that constrain how quickly you can gather data. Web scraping serves as a backup strategy when API limits are exhausted or for gathering additional data like customer reviews that the API does not expose. It must comply with Amazon’s robots.txt directives and terms of service.

A production scraper setup requires multiple coordinated components working together in a microservices architecture. The scheduler service triggers crawling jobs at configured intervals, prioritizing products based on factors like popularity, price volatility, and time since last update.

Products with historically stable prices might update every 12 hours while frequently changing items update every 2-4 hours. This adaptive scheduling represents a key trade-off between data freshness and operational cost. Distributed crawler instances handle the actual fetching, with each worker responsible for a subset of products to enable horizontal scaling. A proxy layer rotates IP addresses to avoid rate limiting while managing region-specific requests for different Amazon storefronts.

Pro tip: Implement content hashing on scraped data before database comparison. A simple MD5 hash of the price, availability, and key metadata fields lets you detect changes with a single comparison rather than field-by-field checks. This dramatically improves processing throughput and reduces unnecessary database writes by 60-80% during stable price periods.

Rate limiting requires careful implementation using token-bucket or leaky-bucket algorithms to ensure compliance with API quotas. When requests fail, exponential backoff policies automatically retry with increasing delays to avoid overwhelming already-stressed endpoints.

Change detection through checksum-based comparisons allows the system to update storage only when data actually differs from previous records. This significantly reduces database load. For a system tracking 50 million products with updates every 6 hours, eliminating redundant writes can cut storage operations dramatically during stable price periods.

Legal and ethical considerations deserve particular attention when building a sustainable scraping system. Amazon’s terms of service and robots.txt file define what automated access they permit. Violating these terms risks IP bans, legal action, or complete API access revocation.

Responsible scraping involves respecting crawl delays specified in robots.txt, identifying your bot with a clear user-agent string, and avoiding aggressive request patterns that could impact Amazon’s servers. Proxy infrastructure requires careful management. Rotating through residential proxies helps avoid detection, but using proxies from regions that do not match your target storefront can return incorrect prices due to geographic pricing variations.

A hybrid approach using API calls as the primary source and scraping only for supplementary data provides the most sustainable long-term strategy. With reliable data collection established, the next challenge involves storing billions of price records efficiently.

Designing the database layer for time-series data

The database forms the backbone of any large-scale tracking system, and price tracking presents a specific challenge that general-purpose databases handle poorly. The system generates billions of price records over time, with write patterns that spike during scraping cycles and read patterns that vary based on user activity. The choice of storage technology and schema design directly impacts scalability, query speed, and operational cost, making database selection one of the most consequential architectural decisions.

Three primary entities drive the data model with distinct access patterns. The product table stores relatively static information including the Amazon Standard Identification Number (ASIN), product name, category, image URLs, and metadata timestamps. It changes infrequently and benefits from traditional relational storage with strong indexing.

The price history table captures the time-series data that represents the system’s core value. It stores product identifiers, prices, currencies, timestamps, and source indicators for each observation. It grows continuously and requires storage optimized for append-heavy workloads. The user alert table maintains the relationship between users, products, and their threshold configurations, including alert status and trigger history.

The following diagram illustrates the database schema showing core entities, their relationships, and how data partitioning applies across different storage tiers.

Database schema showing core entities and storage tier relationships

Historical note: Early price tracking systems often used standard relational databases for all data, leading to severe performance problems as history tables grew. The emergence of purpose-built time-series databases like InfluxDB (2013) and TimescaleDB (2017) fundamentally changed how systems handle this workload pattern. They offer automatic partitioning and compression while maintaining query performance.

Database technology selection follows naturally from these access patterns. Relational databases like PostgreSQL excel for structured data with complex relationships, making them ideal for user profiles, product metadata, and alert configurations. Time-series databases such as TimescaleDB, InfluxDB, or Apache Cassandra optimize for the high-volume append operations and time-range queries that price history requires.

TimescaleDB offers particular advantages since it extends PostgreSQL with automatic partitioning and compression while maintaining SQL compatibility. This reduces the operational complexity of managing separate database systems. The trade-off between SQL and NoSQL options depends on query complexity requirements and operational expertise.

Storage optimization strategies

Storing years of price history for millions of products demands aggressive optimization to control costs and maintain query performance. Partitioning splits tables by time periods or product categories, allowing queries to scan only relevant data segments rather than entire tables.

A common approach partitions price history by month, enabling fast queries for recent data while allowing older partitions to move to cheaper storage tiers. Sharding distributes data across multiple database nodes based on a partition key, with product-based sharding assigning each product’s price history to a specific shard based on a hash of its identifier.

Compression reduces storage requirements significantly. Columnar formats achieve 10-20x compression ratios on time-series data through techniques like delta encoding where consecutive timestamps store only their differences.

Data aggregation provides another powerful optimization by pre-computing summary statistics at multiple time granularities. Raw price observations might retain minute-level precision for the past week, hourly averages for the past month, and daily summaries for older data. This tiered approach reduces storage requirements by orders of magnitude while still enabling accurate chart rendering at any zoom level.

Storage tier	Time range	Granularity	Storage type	Compression ratio
Hot	0-7 days	Per-update (minutes)	TimescaleDB SSD	5x
Warm	7-90 days	Hourly averages	TimescaleDB HDD	15x
Cold	90+ days	Daily summaries	S3 Parquet files	25x

Indexing strategy requires balancing read performance against write overhead. Composite indexes on product identifier and timestamp accelerate the most common query pattern of fetching price history for a specific product within a date range. However, each index adds overhead to write operations, so indexing only the columns that appear in frequent query predicates keeps ingestion performance high.

For time-series data, the timestamp column often serves as the primary index through partitioning, making additional timestamp indexes redundant. The database layer establishes how data persists, but users interact through APIs that must respond quickly regardless of underlying data volume.

API design and frontend integration

At this stage, the backend efficiently collects and stores price data. Users interact through frontend interfaces and APIs where responsiveness determines satisfaction. The API layer mediates between storage systems optimized for data management and the user experience optimized for speed and usability. Well-structured REST APIs enable smooth communication while hiding the complexity of the underlying distributed system. An API gateway handles routing, throttling, and authentication concerns in a centralized layer.

The core API endpoints reflect the primary user interactions. A product endpoint returns current information and latest price for a given ASIN, drawing from cached data for popular items. The history endpoint fetches price time-series data for chart rendering, with parameters controlling the date range and aggregation level.

Alert endpoints handle CRUD operations for user notification preferences, while user endpoints manage watchlists and account settings. Each endpoint maintains stateless operation to enable horizontal scaling of API servers behind load balancers. Alert latency SLA targets typically specify delivery within 60 seconds of a qualifying price change.

Pro tip: Implement API response compression at the gateway level rather than in application code. Services like NGINX or Kong handle compression more efficiently and consistently across all endpoints without requiring changes to individual service implementations. This reduces bandwidth by 60-80% for JSON responses.

Performance optimization at the API layer involves multiple strategies working together. Response caching using Redis stores frequently requested data like popular product histories, with cache invalidation triggered by price updates. Pagination prevents massive response payloads when users have extensive watchlists or when searches return thousands of results.

Response compression using gzip or Brotli reduces bandwidth requirements, particularly important for mobile users on slower connections. Content delivery networks cache static assets and even some API responses at edge locations closer to users, reducing latency for geographically distributed audiences.

Frontend architecture decisions significantly impact both performance and SEO. The choice between server-side rendering and client-side rendering affects how search engines index product pages. SSR provides better SEO since pre-rendered pages with price history metadata can rank for users searching for specific product prices.

Frameworks like Next.js enable hybrid approaches where product pages render server-side for initial load while subsequent interactions happen client-side. Chart rendering presents its own challenge since displaying years of daily prices means potentially thousands of data points per visualization. Libraries like Chart.js handle rendering while the API returns data in formats optimized for visualization rather than raw database records.

Pre-aggregation at appropriate granularities combined with progressive loading creates responsive interactions even for extensive price histories. Fast APIs matter little if users miss the price drops they were tracking, making the notification system the most time-sensitive component.

Building the notification and alerting system

Price alerts represent the feature that drives user engagement and retention, making the alerting system’s reliability and speed critical to business success. When a user sets a price threshold, they expect notification within seconds of a qualifying price drop. They do not want to wait minutes or hours when the deal may have expired. The alerting architecture must process millions of user thresholds against incoming price updates while maintaining delivery guarantees and avoiding notification spam that would train users to ignore alerts entirely.

The following diagram illustrates the alert evaluation pipeline, showing how price updates trigger threshold comparisons and flow through the notification delivery system.

Alert evaluation and notification delivery pipeline

The alert workflow begins when a price update arrives from the processing pipeline. Rather than checking every user alert against every price update, the system maintains an inverted index mapping product identifiers to interested users. When Product X’s price updates, the system retrieves only the alerts configured for Product X and evaluates each threshold.

This targeted approach reduces the evaluation space dramatically, turning a potentially quadratic problem into a linear one. For a system with 10 million products and 50 million alerts, checking only relevant alerts per update reduces comparisons from 500 trillion to an average of five per product.

Watch out: Alert evaluation must happen on price updates, not user requests. A common mistake involves checking thresholds only when users view their alerts, which misses deals that occur between visits and defeats the purpose of automated tracking. Event-driven evaluation ensures immediate detection regardless of user activity.

Message queues provide the reliability backbone for alert processing. When a price triggers an alert, the system publishes an event to a notification queue rather than sending the notification directly. This decoupling allows the alerting service to acknowledge price updates quickly while notification delivery happens asynchronously.

If the email service experiences temporary problems, messages wait in the queue rather than being lost. Dead-letter queues capture messages that fail repeatedly for manual investigation and replay after issues resolve.

Preventing notification spam requires careful debounce logic implementation. Users who set a $50 threshold for a product do not want notifications every time the price drops below that threshold. If a product settles at $45, the system should send one notification, not repeated alerts every time the scraper confirms the same price.

Debouncing tracks when each alert last triggered and suppresses duplicate notifications within a configurable window, typically 24 hours. Additional spam prevention detects trivial price fluctuations. Amazon prices sometimes oscillate by pennies multiple times daily due to algorithmic repricing. Implementing a hysteresis buffer where prices must drop significantly below the threshold before triggering eliminates nuisance notifications.

Batch processing further improves experience by combining multiple alerts into digest emails rather than separate messages. The alerting system demonstrates how real-world systems balance speed, reliability, and user experience. All components must scale together gracefully.

Scaling and performance optimization

Scaling sits at the heart of any system designed to handle millions of products and users. The architecture must support traffic spikes during major sales events while remaining cost-efficient during normal operations. Horizontal scaling provides the foundation, adding server capacity linearly rather than upgrading individual machines to more powerful configurations. Each layer of the system scales independently based on its specific resource requirements, from scrapers through APIs to databases. This creates a microservices architecture where bottlenecks can be addressed surgically.

Load balancers distribute incoming traffic across multiple server instances using algorithms that account for server health and current load. Round-robin distribution works for stateless services where any server can handle any request. Consistent hashing helps route related requests to the same servers when beneficial for caching.

Health checks continuously monitor server availability, automatically removing unhealthy instances from rotation and returning them when they recover. During traffic spikes, auto-scaling policies spin up additional instances within minutes. This requires services designed to handle dynamic cluster membership gracefully.

Real-world context: During Amazon Prime Day, price tracking sites see traffic increases of 5-10x normal levels as users research deals. Systems must scale to handle this surge while scrapers simultaneously increase their update frequency to capture rapidly changing prices. This creates compound load that tests every architectural assumption.

Caching provides the biggest performance improvement for most web applications. Application caches using Redis store frequently accessed data like popular product histories and current prices, with cache keys incorporating product identifiers and time ranges. TTLs set based on how frequently underlying data changes balance freshness against load reduction. TTLs of 15-30 minutes provide significant benefit for price data that updates every few hours.

Content delivery networks extend caching to the network edge, storing static assets at locations geographically close to users. Cache invalidation deserves careful attention since stale data creates user-visible problems. Event-driven invalidation where price updates trigger cache purges provides more precise control than time-based expiration alone.

Database scaling requires different strategies depending on whether read or write performance limits the system. Read replicas offload query traffic from the primary database, with application logic routing read queries to replicas while writes go to the primary. For price tracking where most queries involve reading historical data, read replicas prove highly effective despite slight replication lag.

Write scaling through sharding distributes data across multiple database nodes based on a partition key. Product-based sharding assigns each product’s history to a specific shard based on a hash of its identifier. Cross-shard queries become more complex, but price tracking queries typically access single products at a time, making this partitioning strategy natural. Scaling enables handling increased load, but systems must also function when individual components fail.

Fault tolerance and system reliability

A reliable price tracking system recovers from failures automatically, maintaining service continuity even when individual components fail. Every large-scale system experiences failures ranging from network partitions to hardware problems to software bugs. Architecture must expect rather than prevent these events. Designing for failure means building redundancy at every layer and implementing recovery mechanisms that operate without human intervention. This includes circuit breaker patterns that prevent cascading failures.

Redundancy forms the first line of defense against failures. Multiple scraper instances ensure data collection continues even when individual scrapers fail. Replicated databases maintain data availability when primary nodes become unavailable. Backup notification services take over when primary channels experience problems.

Each critical component runs in multiple instances across different availability zones, ensuring that localized failures do not impact overall system availability. This distributed approach increases complexity but provides the resilience that users expect from always-on services.

Pro tip: Test failure scenarios regularly using chaos engineering practices. Tools like Netflix’s Chaos Monkey randomly terminate instances in production, validating that redundancy mechanisms actually work under realistic conditions rather than just in theory. Scheduled game days that simulate outages build operational muscle memory.

Network requests fail for many transient reasons including temporary overload, network congestion, or brief service interruptions. Retry policies with exponential backoff automatically repeat failed requests with increasing delays, preventing retry storms that could overwhelm recovering services. A request might retry after 1 second, then 2 seconds, then 4 seconds, giving downstream services time to recover.

Circuit breakers prevent cascading failures by stopping requests to unhealthy services entirely. When a downstream service fails repeatedly, the circuit breaker opens and subsequent requests fail immediately rather than waiting for timeouts. This gives failing services time to recover without continued load.

Graceful degradation ensures users access core functionality during partial outages. When the database becomes slow or unavailable, cached data enables read operations to continue for previously accessed products. When notification services fail, alerts queue for later delivery rather than being lost. When scrapers fall behind during traffic spikes, the system serves slightly stale prices rather than refusing all requests.

Each degradation path requires explicit design and testing to ensure it works when needed. Observability through monitoring tools like Prometheus and Grafana provides visibility into system health. Distributed tracing with tools like Jaeger helps diagnose problems across the microservices architecture. Disaster recovery planning extends reliability to site-wide events through regular backups, infrastructure-as-code, and documented recovery procedures. Security and privacy considerations form the final architectural layer.

Security and privacy considerations

Data trust matters as much as system reliability since price tracking involves user accounts, email addresses, and browsing behavior that users expect to remain private. Security must integrate into every architectural layer rather than being added as an afterthought. Authentication using OAuth2 or JWT tokens provides secure session management without storing sensitive credentials on the server side. Role-based access control separates administrator functions from regular user capabilities, limiting the blast radius of compromised accounts.

Data protection involves encryption at multiple levels. All communication uses TLS encryption preventing interception of data in transit. Sensitive data at rest including email addresses and authentication tokens receives AES-256 encryption in storage. Key rotation policies regularly update encryption keys, limiting exposure if keys become compromised.

API security includes input validation and sanitization to prevent injection attacks. It also includes strict CORS policies controlling which domains can make API requests and rate limiting to prevent abuse or denial-of-service attacks.

Watch out: Price tracking systems aggregate user interest data that reveals purchasing intent, which has commercial value and attracts attention from malicious actors. Treat alert configurations with the same security consideration as financial data, implementing audit logging and access controls for any queries against user preferences.

Privacy compliance extends beyond security to how data gets collected, used, and retained. Users must consent to email alerts explicitly with easy opt-out mechanisms for those who change their minds. Activity logging for analytics must anonymize user identification to prevent building trackable profiles. Data retention policies define how long historical information persists with automatic deletion beyond retention periods.

Regional privacy regulations like GDPR impose additional requirements for users in those jurisdictions, potentially including data portability and deletion on request. Understanding Amazon Buy Box dynamics and how multiple sellers compete for featured placement adds another dimension to price tracking that advanced systems must consider.

Conclusion

Building a price tracking system like CamelCamelCamel illuminates fundamental principles of distributed System Design that extend far beyond this specific application. The architecture demonstrates how to handle continuous background workloads that operate independently of user traffic through event-driven processing. It shows how to choose and optimize time-series databases for specific access patterns like append-heavy writes and range queries. It also shows how to build alerting pipelines that balance speed with reliability through message queues and circuit breakers.

The careful attention to debounce logic, cache invalidation, and graceful degradation reflects the maturity required for production systems serving millions of users.

These patterns apply directly to other domains involving continuous data collection, historical analysis, and threshold-based notifications. Stock price monitors, infrastructure alerting systems, and IoT sensor platforms all share architectural similarities with price tracking. The specific technologies evolve with new time-series databases and streaming platforms emerging regularly. However, the underlying principles of decoupled ingestion through message queues, efficient storage tiering across hot and cold layers, and reliable event processing through retry policies and dead-letter queues remain constant across generations of systems.

The future of price tracking points toward increased automation and broader scope. Machine learning models could predict optimal buying times rather than just reporting historical trends. Multi-retailer tracking would compare prices across Amazon, Walmart, and other platforms simultaneously. Real-time browser extension integration could surface price history during shopping sessions without requiring separate site visits. Whatever direction the technology evolves, mastering the foundational patterns explored here provides the architectural vocabulary to design, discuss, and build the next generation of data-intensive applications.