Anthropic and OpenAPI both have options that let you use their API without training the system on your data (not sure if the others do as well), so if t3chat is simply using the API it may be that they themselves are collecting your inputs (or not, you’d have to check the TOS), but maybe their backend model providers are not. Or, who knows, they could all be lying too.
The scenario you describe with ISPs is pretty US-centric, as are the various copyright laws and companies backing it, which is (one of the reasons) why many of the most successful VPN companies are either not based in the US (and most have server nodes that are not too).
Mullvad is from Sweden, for example, and Proton is from Switzerland, so if a content company can even figure out which endpoint nodes are hosting/routing the pirate content they then also have to figure out (a) who owns the node and (b) then send them an angrygram which will just immediately be torn up by the VPN provider as they’re not subject to US law.
Finally, an operating principle of these companies is to keep no logs, so even if a US-based VPN company got an angry letter, they’d probably be unable to do anything since they would have no record of the activity.
I think the only rule they had when “planning” Dallas was “there are no rules”. Zero zoning rules means one giant skyscraper in the middle of a mile of strip malls, multiple city “centers”, vast areas of it are competely unwalkable due to lack of sidewalks and/or what are basically highways running through them, and no mass transit to speak of. It’s like they took 5 shitty, small cities and glued them together with more shitty city material.
Are you looking for a tool that can diff legal documents line by line or clause by clause? If the latter I’d bet an LLM with a large context size could do a pretty good job, especially if you used a script (or another pass through the LLM) to break them down into like sections so that could just compare e.g. all Controlling Law sections with each other and all IP Indemnification sections with each other.
Now that I think about it, tuning the prompt (and keeping the temperature very low, like 0) you could probably get it to return everything from proper diffs to summaries of conceptual differences. And it could definitely do multiples at once if you were to break them into like pieces ahead of time.
You can kinda do it with Google Customizabe Search Engine, which is basically a thin wrapper around Google. In a regular Google search you can use syntax like -site:ignorethisdomain.com to exclude specific domains (i do this with Pinterest whenever searching for images, for example). But manually typing in a large list of black listed domains would be tedious so instead you can set up a CSE with everybody you want to ignore and then just use the special URL as your search engine.
I worked in a field that managed a lot of technology in retail stores. The big ones know everything about you, it’s just astonishing. At the time (around 15 years ago) there was very little oversight, but also most CIOs were inept and couldn’t really make the data sing and dance. Today that is very much no longer true, and it’s almost too easy to build a comprehensive profile of an “anonymous” guest and then attach it to their personally identifiable information, all without their consent or knowledge.
Not that we have any real info about who collects/uses what when you use the API