From e4d1c2848601d792ce16eee50222abeb23b77c11 Mon Sep 17 00:00:00 2001 From: Pinapelz Date: Thu, 28 May 2026 11:44:28 -0700 Subject: init commit --- README.md | 11 +++++++++++ 1 file changed, 11 insertions(+) create mode 100644 README.md (limited to 'README.md') diff --git a/README.md b/README.md new file mode 100644 index 0000000..486bdb0 --- /dev/null +++ b/README.md @@ -0,0 +1,11 @@ +Scraper to collect and resize kpop images for a captcha. Made for 4get + +1. Downloads images accoding to groups specified in `PULL_GROUPS` in `groups.py` +2. Renames images to numberd `[x].png` +3. Uses insightface's `buffalo_1` face detection to crop a 100x100 area around faces in downloaded image + +Configure `groups.py` accordingly before running +``` +uv sync +uv scrape_data.py +``` -- cgit v1.2.3