Privacy-oriented Tile Alternative

@WalnutLum@lemmy.ml

For a lot of people for a long time your insurance card (that didn’t have a photo) was the only “identification” you had. Otherwise you had to bring your school ID, work ID etc.

Most people don’t have drivers licenses cause they take the train. When you sign up for banks etc you usually have to get a bunch of official documentation from the local ward office with your information.

Proof of identity in Japan has always been a bit of a hazy problem. You sign most documents still with a family stamp, so the idea of what legally is defined as identifying is kind of vague.

Most local offices aren’t networked up, so when you move you have to register with your local ward office and the japanese beauricratic army goes and gets the previous ward office to fax over your info.

“My Number” is the japanese governments attempt to get all that stuff wired together in one database.

@WalnutLum@lemmy.ml

Yes the “my number card” (national ID) was mostly a volunteer thing but now that it’s needed for health insurance it’s required of everyone

@WalnutLum@lemmy.ml

Japan just hooked up the national health insurance to the national ID.

@WalnutLum@lemmy.ml

I know of matrix, what are some other alternatives?

Also a protocol that got falsely maligned during the crypto days was secure scuttlebutt, and people should be talking about it more.

@WalnutLum@lemmy.ml

I assume a lot of android foss app developers are going to refuse to register and the projects are going to need to be forked.

Personally I’m getting an old feature phone and an ipad mini that only has wifi. If my choice is between apple iOS and google iOS I’d rather just not use anything to do with Google.

@WalnutLum@lemmy.ml

iOS has had this same system for forever and nobody’s ever (seriously) claimed you could sideload on there.

@WalnutLum@lemmy.ml

That’s good and also somewhat disappointing as they were the first to release the weights and mechanism to run them as open weights.

A lot of fully open source (and “ethically trained”, depending on your opinion of that entire idea) models still use major portions of the code they open sourced.

A lot of relatively “good” LLM models run on top of Llama.cpp

@WalnutLum@lemmy.ml

“it says here you clicked ‘sign me up for ISIS’ 10000 times?”

“Haha no officer, you see it was my social chaff AI that clicked it”

@WalnutLum@lemmy.ml

I highly recommend Obtainium to anyone who wants to keep their apps updated without needing a central report (save for the APKs that only publish on f-droid etc)

@WalnutLum@lemmy.ml

You’re going to have to learn python.

Here’s a good overview: https://huggingface.co/docs/transformers/training

@WalnutLum@lemmy.ml

Or open source groups can make a fully open repro of it: https://github.com/huggingface/open-r1

@WalnutLum@lemmy.ml

It’s your queries + your IP combined with the rest of the data the net collects from you that identifies you.

@WalnutLum@lemmy.ml

The fun part is that you don’t have to do all that stuff if you have a long term visa.

@WalnutLum@lemmy.ml

Oh, yes but the DRM exemption clause means that you can backwards engineer the changes and continue releasing them under GPL

Edit: as an example we should probably be looking at the duckststion situation evolving right now:

https://vimuser.org/duckstation.html

@WalnutLum@lemmy.ml

“releasing the modified version to the public” would cover them re-closing the source and then subsequently releasing that newly closed source, so they can’t relicense it and then release the built version of the code.

At least not easily, this is where court history would likely need to be visited because the way it’s worded the interpretability of “modified” in this context would need to be examined.

@WalnutLum@lemmy.ml

One of the few practical things AI might be good at:

https://github.com/CorentinJ/Real-Time-Voice-Cloning

@WalnutLum@lemmy.ml

NewPipe can do peertube as well

@WalnutLum@lemmy.ml

Yes of course, there’s nothing gestalt about model training, fixed inputs result in fixed outputs

@WalnutLum@lemmy.ml

I suppose the importance of the openness of the training data depends on your view of what a model is doing.

If you feel like a model is more like a media file that the model loaders are playing back, where the prompt is more of a type of control over how you access this model then yes I suppose from a trustworthiness aspect there’s not much to the model’s training corpus being open

I see models more in terms of how any other text encoder or serializer would work, if you were, say, manually encoding text. While there is a very low chance of any “malicious code” being executed, the importance is in the fact that you can check the expectations about how your inputs are being encoded against what the provider is telling you.

As an example attack vector, much like with something like a malicious replacement technique for anything, if I were to download a pre-trained model from what I thought was a reputable source, but was man-in-the middled and provided with a maliciously trained model, suddenly the system I was relying on that uses that model is compromised in terms of the expected text output. Obviously that exact problem could be fixed with some has checking but I hope you see that in some cases even that wouldn’t be enough. (Such as malicious “official” providence)

As these models become more prevalent, being able to guarantee integrity will become more and more of an issue.

@WalnutLum@lemmy.ml

I’ve seen this said multiple times, but I’m not sure where the idea that model training is inherently non-deterministic is coming from. I’ve trained a few very tiny models deterministically before…

@WalnutLum@lemmy.ml

I’m not sure where you get that idea. Model training isn’t inherently non-deterministic. Making fully reproducible models is 360ai’s apparent entire modus operandi.

@WalnutLum@lemmy.ml

There are VERY FEW fully open LLMs. Most are the equivalent of source-available in licensing and at best, they’re only partially open source because they provide you with the pretrained model.

To be fully open source they need to publish both the model and the training data. The importance is being “fully reproducible” in order to make the model trustworthy.

In that vein there’s at least one project that’s turning out great so far:

https://www.llm360.ai/

@WalnutLum@lemmy.ml

Holy crap there are still working nitter instances? God bless

@WalnutLum@lemmy.ml

The project was using a way to bypass requiring a backing account to proxy the requests, but the API update broke that

The instances that chose (and choose) to go the extra mile by creating and maintaining proxy account(s) are the ones still working

If the instance gets too popular the twitter goons quickly figure out what the proxy account is and ban it, though. So it’s a constant game of cat and mouse.

@WalnutLum@lemmy.ml

Well, then there’s also this:

Green Pass PDF Wallet https://f-droid.org/packages/com.michaeltroger.gruenerpass/

Extremely simple functionality but it does exactly what’s on the tin

@WalnutLum@lemmy.ml

If you need something to store pkpass files: fWallet https://f-droid.org/packages/business.braid.f_wallet/