HooperData | Documentation

Overview

HooperData is my personal PostgreSQL (pSQL) database that serves as the central data hub for all Universe-J basketball-related web applications. It routinely processes, and stores basketball data at different intervals based on the type of data being collected.

By keeping an up-to-date, structured, and centralized data source, HooperData eliminates the need for repetitive web scraping whenever a user interacts with one of the web apps. This ensures faster, more efficient data retrieval while maintaining accuracy and historical tracking.

Key Features

Structured PostgreSQL Database – Stores cleaned and formatted basketball data in an optimized structure for easy access.
Seamless Web App Integration – Supplies real-time basketball data to multiple web applications.
Automated Logging & Backups – Uses Loguru for logging and AWS S3 for CSV backups with JSON metadata.
Custom VPS Hosting – Runs on a dedicated VPS setup, ensuring reliability and control over the database environment.

How It Works

1. Data Refresh Intervals

Different types of basketball data are pulled at optimized intervals, for example:

Current Regular Season Stats – Once daily.
Awards (Monthly & Postseason) – Monthly for regular season awards and daily during April–July for postseason data.

2. Data Processing & Storage

Once populated, the data is cleaned, formatted, and structured before storage.
Data is stored in PostgreSQL tables optimized for easy querying and integration.
The structure ensures that the data can be easily accessed and implemented across different web applications.

Overview

Key Features

How It Works

1. Data Refresh Intervals

2. Data Processing & Storage

3. Logging & Backups