diff options
| author | Pinapelz <yukais@pinapelz.com> | 2026-05-28 11:44:28 -0700 |
|---|---|---|
| committer | Pinapelz <yukais@pinapelz.com> | 2026-05-28 11:44:28 -0700 |
| commit | e4d1c2848601d792ce16eee50222abeb23b77c11 (patch) | |
| tree | 4392f43c252c6686bf28b86877f184f2d480fe41 /README.md | |
Diffstat (limited to 'README.md')
| -rw-r--r-- | README.md | 11 |
1 files changed, 11 insertions, 0 deletions
diff --git a/README.md b/README.md new file mode 100644 index 0000000..486bdb0 --- /dev/null +++ b/README.md @@ -0,0 +1,11 @@ +Scraper to collect and resize kpop images for a captcha. Made for 4get + +1. Downloads images accoding to groups specified in `PULL_GROUPS` in `groups.py` +2. Renames images to numberd `[x].png` +3. Uses insightface's `buffalo_1` face detection to crop a 100x100 area around faces in downloaded image + +Configure `groups.py` accordingly before running +``` +uv sync +uv scrape_data.py +``` |
