Just like here you could get a timeline of key events, a graph of connected entities, links to original documents.
Newsrooms might already do this internally idk.
This code might work as a foundation. I love that it's RDF.
Tech Company: At long last, we have created the Torment Nexus from classic sci-fi novel Don't Create The Torment Nexus
https://developer.nytimes.com/docs/semantic-api-product/1/ov...
The Guardian has similar:
https://open-platform.theguardian.com/documentation/tag
Either or both could be an interesting starting point for something like that. I tried to find something for the BBC and was surprised they didn’t have anything. I would have figured public media would have been a great resource for this.
That said, some networks of shorter paths than 6 are interesting. Right now, there's a 1:1 direct path from these documents to a bunch of people with an interest in confounding what evidentiary value they have in justice processes. That's more interesting to me, than what the documents say right now.
These general data models start to become useful and interesting at around a trillion edges, give or take an order of magnitude. A mature graph model would be at least a few orders of magnitude larger, even if you aggressively curated what went into it. This is a simple consequence of the cardinality of the different kinds of entities that are included in most useful models.
No system described in open source can get anywhere close to even the base case of a trillion edges. They will suffer serious scaling and performance issues long before they get to that point. It is a famously non-trivial computer science problem and much of the serious R&D was not done in public historically.
This is why you only see toy or narrowly focused graph data models instead of a giant graph of All The Things. It would be cool to have something like this but that entails some hardcore deep tech R&D.
To plug my project, I've wrapped the SuiteSparse GraphBLAS library in a postgres extension [1] that fluidly blends algebraic graph theory with the relational model, the main flow is to use sql to structure complex queries for starting points, and then use the graphblas to flow through the graph to the endpoints, then joining back to tables to get the relevant metadata. On cheap hetzner hardware (amd epyc 64 core) we've achieved 7 billion edges per second BFS over the largest graphs in the suitesparse collection (~10B edges). With our cuda support we hope to push that kind of performance into graphs with trillions of edges.
Using this press release as an example, if you pay attention to the details you'll notice that this graph has an anomalously low degree. That is, the graph is very weakly connected, lots of nodes and barely any edges. Typical graph data models have much higher connectivity than this. For example, the classic Graph500 benchmark uses an average degree of 16 to measure scale-out performance.
So why did they nerf the graph connectivity? One of the most fundamental challenges in scaling graphs is optimally cutting them into shards. Unlike most data models, no matter how you cut up the graph some edges will always span multiple shards, which becomes a nasty consistency problem in scale-out systems. Scaling this becomes exponentially harder the more highly connected the graph. So basically, they defined away the problem that makes graphs difficult to scale. They used a graph so weakly connected that they could kinda sorta make it work on a thousand(!) machines even though it is not representative of most real-world graph data models.
That is a wild claim. Perhaps for some very specific definition of "useful and interesting"? This dataset is already interesting (hard to say whether it's useful) at a much tinier scale.
Almost every non-trivial graph data model about the world is a graph of human relationships in the population. If not directly then by proxy. Population scale human relationship graphs commonly pencil out at roughly 1T edges, a function of the population size. It is also typically the highest cardinality entity. Even the purpose isn’t a human relationship graph, they all tend to have one tacitly embedded with the scale implied.
If you restrict the set of human entities, you either end up with big holes in the graph or it is a graph that is not generally interesting (like one limited to company employees).
The OP was talking about generalizing this to a graph of people, places, events, and organizations, which always has this property.
It is similar to the phenomenon that a vast number of seemingly unrelated statistics are almost perfectly correlated with GDP.
Americans..?
I didn't go down the route of LLMs for the clean up, as you're getting into scale and context issues with larger datasets.
I got into semantic similarity networks for this use case. You can do efficient pairwise matching with Annoy, set a cutoff threshold, and your isolated subgraphs are merger candidates.
I wrapped up my code in a little library if you're into this sort of thing.
github.com/specialprocedures/semnet
[1] - https://www.youtube.com/watch?v=XcebHfZ2LbU [video][4 mins]
https://github.com/maxandrews/Epstein-doc-explorer/blob/83ee...
The Bill Clinton entity is interesting.
> 2009: Bill Clinton discontinued association with Jeffrey Epstein
> 2010: Jeffrey Epstein provided flights on jets to Bill Clinton
> 2010-2011: Jeffrey Epstein traveled via private aircraft with Bill Clinton
> 2011: Ghislaine Maxwell piloted helicopter for Bill Clinton
> 2014: Bill Clinton alleged presence at sex parties
> 2015: Bill Clinton distanced relationship from Jeffrey Epstein
Wasn't very good at discontinuing the relationship it seems.
Guess there is precedent for him lying about sexual activities though.
I think a sentiment analysis between the friendliness and social meetups between Epstein and other individuals would be useful.
Who were his friends after 2008 when he was first convicted?
Those who were still friends with him after 2008 were in on it or guilty by association, if not legally, socially.
Friends like Reid Hoffman and Larry Summers...
> From: Reid Hoffman
> Sent: 7/6/2015 5:04:31 PM
> To: jeffrey E. [jeeyacation@gmail.com]
> Subject: RE: ICYMI
> slow progress.
> planning to see you in August.
> Hope you're well.
Larry Summers has too many to list. Doesn't look good though digging through them.
Keep in mind that those summaries are AI-generated. There's gonna be a lot of confabulating in there.
Care to dispute the summaries using the sources?
This is of course ontop of the 2006 Florida prostitution charge though.
But point being those people that were friends with him had to know. Whether it was socially acceptable by the elite because the public wasn't aware isn't very relevant.
Trump gave information against Epstein in 2009 and unlike Bill and others did cut ties after learning he was poaching girls from Mar-a-Lago.
I specifically made the point to look into those who were friends with Epstein even after knowing what he was doing.
Nice whataboutism though. Feel free to reference source materials to support your claims.
Btw are you a bot or is that just a canned statement you use?
But sure, lock all of them up, just don't ignore a few because they are too powerful.
I’m still convinced it’s a minority of loud voices online and on social media.
Point to an email in this dump or anything else.
It's clear as day Trump cut ties when he found out who he was and was against him.
Not so much for others.
Personally I've never been shocked that some of the most powerful people in the world like to go to a private sex-island where they could do as they pleased. That's precisely the incentive to becoming so incredibly powerful in the first place: to be able to pursue personal gain with increasingly less consequences.
Interesting attempted deflection away from Trump.
Pre-2009 records on Trump there are nasty. One example:
> ... It was at these series of parties that I personally witnessed the Plaintiff being forced to perform various sexual acts with Donald J. Trump and Mr. Epstein. Both Mr. Trump and Mr. Epstein were advised that she was 13 years old. I personally witnessed four sexual encounters that the Plaintiff was forced to have with Mr. Trump during this period, including the fourth of these encounters where Mr. Trump forcibly raped her despite her pleas to stop.
Only difference between Clinton and Trump is that Trump is still president.
Love how we have actual evidence against people but discussions always devolve into some conspiracy related to Trump.
----
Based on the available evidence, there is no confirmed meeting between Trump and Epstein in 2017. While both men were in Palm Beach during Thanksgiving week 2017, there is no direct evidence they met.
Here's what we know about their presence in Palm Beach that week:
- Trump was at Mar-a-Lago from November 21-26, 2017
- Epstein owned a mansion in Palm Beach and was known to be in the area
- Epstein mentioned both Trump and himself being "down there" (Palm Beach) in an email exchange on November 23, 2017
While there were claims circulating online that Trump spent Thanksgiving with Epstein in 2017, these claims have been thoroughly investigated and found to be unsubstantiated
Trump's official calendar for that week shows his activities included:
- Thanking military members on a virtual call
- Visiting Coast Guard members at Lake Worth Inlet Station
- Playing golf with Tiger Woods and Dustin Johnson
Damn, Trump would have 100% listed his sex crimes on his official calendar. Case closed.
No reason to talk about anyone who actually corresponded with Epstein I guess.
---
Epstein to Maxwell 2011-04-02
> i want you to realize that that dog that hasn’t barked is trump... [VICTIM] spent hours at my house with him ,, he has never once been mentioned. police chief. etc. im 75% there
---
Epstein to Ruemmler 2018-08-23
> you see, i know how dirty donald is. my guess is that non lawyers ny biz people have no idea. what it means to have your fixer flip
The most "damning" emails are hearsay from other people?
I'm saying there's no direct evidence he did and on face value it's ridiculous.
He was meeting the troops and golfing with Tiger Woods and happened to be in the same state Epstein had a house in.
Have any evidence otherwise, or just conspiracy theories?