Claude Opus came up with this script:
It produces a somewhat-readable PDF (first page at least) with this text output:
(I used the cleaned output at https://pastebin.com/UXRAJdKJ mentioned in a comment by Joe on the blog page)
Any chance you could share a screenshot / re-export it as a (normalized) PDF? I’m curious about what’s in there, but all of my readers refuse to open it.
https://www.mountsinai.org/about/newsroom/2012/dubin-breast-...
https://www.businessinsider.com/dubin-breast-center-benefit-...
Even names match up, but oddly the date is different.
Or worse. She did.
I've known from the second I started doing debate and FX/DX in highschool, well, let's just say I never thought that the majority of the 2FA-folks would be worth a damn when tyranny really came knocking. Fear of the other as a form of manipulation, and a distraction from class consciousness, has been their literal raison d'état since decades before I was born.
I guess I was shocked that the President being a convicted rapist and documented child predator would be a bridge too far. But then we re-elected him.
I believe it. We voted for this. We do nothing in the face of zero actual justice. This is exactly as good as we deserve. And best of all, it certainly doesn't stop here. This is what they chose to not redact. When we know they spent enormous tax-payer hundreds-of-people hours redacting the documents.
I don't think it's even conspiratorial to say they left stuff in, so they could use it as justification for not releasing the other HALF of the files that haven't been released, even overly censored.
We deserve this, and the much worse that our apathy has invited.
I’d never believe Bill Gates would secretly slip antibiotics into his wife’s cocktail to treat an STI he got from a Russian prostitute on convicted pedophile estate.
But here we are.
Unfortunately no, it just seems to be greed, incompetence, and incompetent greed. At least when a tank drives over a protestor somebody gets to be on the side of the tank. When the bus goes off a cliff because the driver sold the steering wheel everybody dies.
the mascot of 4chan was literally pedobear, what time frame are you referring to?
More likely it's just an oversight, but it could also be CYA for dragging their feet, like "you rushed us, and look at these victims you've retraumatized". There are software solutions to find nudity and they're quite effective.
There's redaction to protect victims and there's redaction to protect specific co-conspirators in Epstein's spy ring
The challenge, as we're all experiencing together, is that the law is not inherently self-enforcing.
https://www.govinfo.gov/content/pkg/PLAW-119publ38/pdf/PLAW-... : the Attorney General was to have produced the entirety of the Epstein files, with very narrowly-enumerated redactions, in December. She has not done so.
Furthermore, there are numerous allegations that the documents that have been released contain CSAM, which (referencing the PDF above) may fall afoul of 18 U.S.C. 2252–2252A.
In addition, one need only glance at the action in US courts to see egregious violations of the Constitution and valid court orders playing out daily.
https://www.documentcloud.org/documents/26513988-trorder0128...
https://storage.courtlistener.com/recap/gov.uscourts.mnd.230...
(It's also worth noting that almost none of the government's appeals to their losses in preliminary injunctions have been on the merits as to whether or not their actions were legal, but rather on the grounds of "no one should be allowed to challenge our actions," which has also been a fairly losing argument for everybody except SCOTUS.)
yes.... any administration can be found guilty of violating law, and should be dealt with accordingly.
https://www.cbsnews.com/minnesota/news/ice-violations-judge-...
> ICE has likely violated more court orders in January 2026 than some federal agencies have violated in their entire existence," Schiltz said, adding that he counted 96 court orders that ICE has violated in 74 cases.
https://www.cbsnews.com/news/frustrations-from-judge-prosecu...
https://www.politico.com/news/2026/01/27/patrick-schiltz-jud...
https://storage.courtlistener.com/recap/gov.uscourts.mnd.230...
https://storage.courtlistener.com/recap/gov.uscourts.mnd.230...
Did you notice that one article I linked involved a DoJ lawyer admitting that she couldn't convince ICE to obey court orders that she was trying to transmit to them? That's beyond an allegation and into admission. How is that not evidence?
More on these ignored court orders:
https://www.mprnews.org/story/2026/01/28/ice-illegally-detai...
They illegally withheld funds (impoundment) from congressionally authorized/mandated expenditures and relied on pocket rescissions to defund programs they didn't like: https://www.cbpp.org/research/federal-budget/pocket-rescissi...
They keep illegally appointing unqualified hacks as US attorney in defiance of the mandate they're approved by the Senate (Essayli, Habba, Halligan, Sarcone, Chattah) - judges have found at least five of the appointments illegal. As one example: https://www.politico.com/news/2025/10/28/judge-los-angeles-t...
They've repeatedly violated court orders to either return immigrant detainees or release them. "This is one of dozens of court orders with which respondents have failed to comply in recent weeks.": https://www.cnn.com/2026/01/27/politics/patrick-schiltz-judg...
The EPA illegally convened a secret panel of climate deniers to issue a sham report in order to repeal the endangerment finding: https://www.nytimes.com/2026/01/30/climate/energy-department...
His targeting and shakedowns of Universities, law firms, and media companies is transparently illegal jawboning.
Everything about the tariffs is obviously illegal which he confirms every time he opens his mouth since he's relying on 'national security' justifications to issue them without Congress and he keeps insisting they're punishment for some random perceived slight.
His illegal firing of Federal workers without the notice required: https://www.npr.org/2025/09/25/nx-s1-5544317/federal-probati...
Some sillier things like renaming the Kennedy Center -- the law that established it literally said that it couldn't be renamed without Congress -- so Trump firing everyone on the board and then appointing a bunch of his flunkees to vote for the name change doesn't cut it.. https://beatty.house.gov/sites/evo-subsites/beatty.house.gov...
It's a literal onslaught of illegality so I can't tell if you haven't read a news article since 2025 or if you're trolling.
The legal situation regarding CSAM is very strict no matter which country, and I better hope no one here will actually be dumb enough to provide actual links.
1. Get an open source pdf decoder
2. Decode bytes up to first ambiguous char
3. See if next bits are valid with an 1, if not it’s an l
4. Might need to backtrack if both 1 and l were valid
By being able to quickly try each char in the middle of the decoding process you cut out the start time. This makes it feasible to test all permutations automatically and linearly
Also look up double/triple data-entry systems, where you have multiple people enter the data and then flag and resolve differences. Won't protect you from your staff banding together to fuck you over with maliciously bad data, but it's incredibly effective to ensure people were Actually Working Their Blocks under healthy circumstances.
I consider myself fairly normal in this regard, but I don't have 76 friends to ask to do this, so I don't know how I'd go about doing this. Post an ad on craigslist? Fiverr? Seems like a lot to manage.
The copy linked in the post:
https://www.justice.gov/epstein/files/DataSet%209/EFTA004004...
Three more copies:
https://www.justice.gov/epstein/files/DataSet%2010/EFTA02153...
https://www.justice.gov/epstein/files/DataSet%2010/EFTA02154...
https://www.justice.gov/epstein/files/DataSet%2010/EFTA02154...
Perhaps having several different versions might make it easier.
https://www.justice.gov/epstein/files/DataSet%209/EFTA007755...
This doesn't solve the "1 & l" problem for the pdf you are looking at, but it could be useful anyway.
https://www.justice.gov/epstein/files/DataSet%2011/EFTA02702...
Unlike every other PDF format that has been attempted, the federal government doesn't have to worry about adoption.
It’s not a tools problem, it’s a problem of malicious compliance and contempt for the law.
For example, when the Mueller reports were released with redactions, they had no searchable text or meta data because they were worried about these exact kind of data leaks.
However, vast troves of unsearchable text is not a huge win for transparency.
PDFs are just a garbage format and even good administrations struggle.
Hmm. Anyone got some spare CPU time?
I tried to find the message in this blog post, but couldn't. (don't see how to search by date).
Followup: pdfimages is 13x faster than pdftoppm
PDF is basically a prettify layer on top of the older PS that brings an all lot of baggage. The moment you start trying to do what should be simple stuff like editing lines, merging pages, change resolution of the images, it starts giving you a lot of headaches.
I used to have a few scripts around to fight some of its quirks from when I was writing my thesis and had to work daily with it. But well, it was still an improvement over Word.
https://www.justice.gov/epstein/files/DataSet%2010/EFTA01804...
https://www.justice.gov/epstein/files/DataSet%209/EFTA007755...
https://www.justice.gov/epstein/files/DataSet%209/EFTA004349...
and than this one judging by the name of the file (hanna something) and content of the email:
"Here is my girl, sweet sparkling Hanna=E2=80=A6! I am sure she is on Skype "
maybe more sinister (so be careful, i have no ideas what the laws are if you uncover you know what trump and Epstein were into)...
https://www.justice.gov/epstein/files/DataSet%2011/EFTA02715...
[Above is probably a legit modeling CV for HANNA BOUVENG, based on, https://www.justice.gov/epstein/files/DataSet%209/EFTA011204..., but still creepy, and doesn't seem like there's evidence of her being a victim]
It's really really hard to give them the benefit of the doubt at this point.
Incompetence is incompetence.
A dynamic programming type approach might still be helpful. One version or other of the character might produce invalid flate data while the other is valid, or might give an implausible result.
Cool article, however.
The recipient is also named in there...
The search on the DOJ website (which we shouldn't trust), given the query: "Content-Type: application/pdf; name=", yields maybe a half dozen or so similarly printed BASE64 attachments.
There's probably lots of images as well attached in the same way (probably mostly junk). I deleted all my archived copies recently once I learned about how not-quite-redacted they were. I will leave that exercise to someone else.
Of course there are other content-types, e.g. searching for "Content-Type: image/jpeg" gets hits as well. But only a few of them actually have the base64 data, mostly there are just the MIME headers.. Looking for "/9j/" (which is Base64 for FF D8 FF, which is the header for JPEG files), the Trumpian justice.gov website ignores "/" and shows results case-insensitively, but there are 4 or 5 base64'ed JPEG images in there.
I also saw that the page is vulnerable to code injection, somehow garbage in one search result preview was OCREd as "<s [lots of garbage]>", and the rest of the search results were striken-through because "<s>" is the HTML to do that.