By JME | Universe-J
Overview
HooperData is my personal PostgreSQL (pSQL) database that serves as the central data hub for all Universe-J basketball-related web applications. It routinely processes, and stores basketball data at different intervals based on the type of data being collected.
By keeping an up-to-date, structured, and centralized data source, HooperData eliminates the need for repetitive web scraping whenever a user interacts with one of the web apps. This ensures faster, more efficient data retrieval while maintaining accuracy and historical tracking.
Key Features
- Structured PostgreSQL Database – Stores cleaned and formatted basketball data in an optimized structure for easy access.
- Seamless Web App Integration – Supplies real-time basketball data to multiple web applications.
- Automated Logging & Backups – Uses Loguru for logging and AWS S3 for CSV backups with JSON metadata.
- Custom VPS Hosting – Runs on a dedicated VPS setup, ensuring reliability and control over the database environment.
How It Works
1. Data Refresh Intervals
Different types of basketball data are pulled at optimized intervals, for example:
- Current Regular Season Stats – Once daily.
- Awards (Monthly & Postseason) – Monthly for regular season awards and daily during April–July for postseason data.
2. Data Processing & Storage
- Once populated, the data is cleaned, formatted, and structured before storage.
- Data is stored in PostgreSQL tables optimized for easy querying and integration.
- The structure ensures that the data can be easily accessed and implemented across different web applications.
3. Logging & Backups