The original post: /r/datahoarder by /u/comradesean on 2025-02-15 17:11:20.
I'm working on restoring an item unlocker for a video game that relied on an API from 2018, which is no longer active. This process included an HTTP request to a news article that no longer exists and wasn't archived. The good news is that I can take an article from that time period and modify it (which I've already done). However, the JavaScript is broken, and after spending the last week debugging minified and obfuscated JavaScript, I've made no progress.
I'm not familiar enough with HTTrack or other methods for capturing web pages, and no matter what I try, it always seems to break the JavaScript in some way. If anyone has any tips or tricks for capturing a single page with all the necessary scripts intact, your help would be greatly appreciated.
The page I've been trying to use as my base
https://web.archive.org/web/20180414013843fw_/https://blog.twitch.tv/overwatch-league-all-access-pass-on-twitch-8cbf3e23df0a?gi=4debdce8040a
and the httrack that fails me
httrack "https://web.archive.org/web/20180414013843fw_/https://blog.twitch.tv/overwatch-league-all-access-pass-on-twitch-8cbf3e23df0a?gi=4debdce8040a" -O "E:\My Web Sites" "+*.*" "+*.css" "+*.js" "+*.html" "+*.png" "+*.jpg" "+*.gif" "+*.woff" "+*.woff2" "+*.ttf" "+*.svg" --mirror --keep-alive -r99 --max-rate=1000000 --assume "https://web.archive.org/web/20180414013843fw_/https://blog.twitch.tv/overwatch-league-all-access-pass-on-twitch-8cbf3e23df0a?gi=4debdce8040a" --robots=0 --referer "https://web.archive.org/" -%Pt
If anyone has any questions about the whole thing feel free to ask. The rest of the application is essentially done as I've mapped out the memory addresses in the application and recreated an barebones and stubby api just to emulate the process. This is the last piece needed.