>>22532
I'm using an existing library for the parsing, but the library doesn't have \uXXXX support. They fixed that later, but I can't access that patch using my NPL-System and at work we don't have a 7.5x+ system yet, and that patch is for 7.5x+ systems only. The NPL system is 7.52, but it counts as a demo, and it even has some weird crap in the SQL database, so that you can't grow it forever. There is a trick though and you can remove that restriction (which I did).
The good thing about SAP is that almost everything is basically open source, so you can check what they are doing (and in normal systems even modify it when there is no other way).
The NPL system is also running under Linux (SAP system is available for Windows as well as other OS).
My code seems to be working properly, I even checked what 8kun is doing when JSON related characters are in a post. Those are not replaced with \uXXXX, but with for example \", \ etc. If these characters came in using \uXXXX, I would still replace them with escaped characters, so that the library doesn't have a problem with it.
If my bandwidth was big enough, I could download all videos and upload them to somewhere else. An idea would also be to upload it to anonfiles.com. As far as I can remember they don't delete files unless it really violates the law.
Agreed about PDFs. I really don't understand why 8kun accepts PDFs, but at the same time plain text files are rejected. Plain text is not dangerous at all.
PDFs are archived on wayback machine as well by myself. Sadly archive.IS also doesn't support PDF archiving, otherwise I would do that too.
All threads are archived there though, so if you offer links to the original threads, you can offer links to the archived threads on wayback machine as well as archive.IS. Typically archive.IS threads are better, I haven't seen it once that pictures were missing. On wayback machine that happens sometimes.