Languages are both read and written, restrictions like OP is pining for are fundamentally for reading. As such, it is not terribly helpful that they can opt in to restrictions when writing.
That's the MIX. As mentioned in the README I haven't setup a system for these yet, just made the scaffolding, as a lot of the work in this space is actually around just the MIX layer.
I'll probably be posting about more backend systems once I get started on making them, this project has only been in the works for 5 weeks so far. Though I would invite anyone interested to contribute a MIX module, that's where I have the least expertise.
The demos run as interpreted trees through the x stage, no code emission yet, just direct execution of the AST. That's why they're demos rather than compilers. The scaffolding for native emission is there but empty.
The project is an inference framework which should support 100B parameter model at 5-7tok/s on CPU. No one has quantized a 100B parameter model to 1 trit, but this existing is an incentive for someone to do so.
I had the same question, after some debates with Chatgpt, it's not the "quantize" for post-training we often witness these days, you have to use 1 trit in the beginning since pre-train.
What would be great would be if they tried for a while, and if they didn’t succeed, explicitly report that they failed, and then present their best effort
A pretty poor showing by the villagers. whalers were able to kill a full grown whale in at most a couple of hours, while it was trying to swim away, while balancing in rowboats.
There's a subtext in your point that I want to expand on.
Tech people, particularly engineers, tend to make a fundamental error when dealing with the law that almost always causes them to make wrong conclusions. And that error is that they look for technical compliance when so much of the law is subjective and holistic.
An example I like to use is people who do something illegal on the Internet and then use the argument "you can't prove I did it (with absolute certainty)". It could've been someone who hacked your Wifi. You don't know who on the Wifi did it, etc. But the law will look at the totality of the evidence. Did the activity occur when you were at home and stop when you weren't? How likely are alternative explanations? Etc.
All of that will be considered based on some legal standard depending on the venue. In civil court that tends to be "the preponderance of the evidence" (meaning more likely than not) while in criminal court it's "beyond a reasonable doubt" (which is a much higher standard).
So, using your example, an engineer will often fall into a trap of thinking they can substitute enough words to have a new original work, Ship of Theseus-like. And the law simply doesn't work that way.
So, when this gets to a court (which it will, it's not a question of "if"), the court will consider how necessary the source work was to what you did. If you used it for a direct translation (eg from C++ to Go) then you're going to lose. My prediction is that even using it in training data will be cause for a copyright claim.
If you use Moby Dick in your training data and ask an LLM to write a book like Moby Dick (either explicitly or implicitly) then you're going to have an issue. Even if you split responsibilities so one LLM (training on Moby Dick) comes up with a structure/prompt and another LLM (not trained on Moby Dick) writes it, I don't think that'll really help you avoid the issue.
> So, when this gets to a court (which it will, it's not a question of "if"), the court will consider how necessary the source work was to what you did. If you used it for a direct translation (eg from C++ to Go) then you're going to lose. My prediction is that even using it in training data will be cause for a copyright claim.
This has a lot of similarity to when colorization of film started popping up. Did colorizing black and white movies suddenly change the copyright of the film? At this point is seems mostly the courts say no. But you may find sometimes people rule the other way and say yes. But it takes time and a lot of effort to get what in general people want.
But basically if you start with a 'spec' then make something you probably can get a wholly owned new thing. But if you start with the old thing and just transform it in some way. You can do that. But the original copyright holders still have rights too to the thing you mangled too.
If I remember right they called it 'color of copyright' or something like that.
The LLM bits you are probably right. But that has not been worked out by the law or the courts yet. So the courts may make up new case law around it. Or the lawmakers might get ahead of it and say something (unlikely).
A work could even have two copyrights! Copying a colorized film could require the permission of both the studio that made the film, and the studio that colorized it.
> And that error is that they look for technical compliance when so much of the law is subjective and holistic.
I know it sounds like an oversimplification, but "got off on a technicality" is a common thing among the well-connected and well-heeled. Sure, us nerds probably focus too much on the "technicality" part, since we are by definition technical, but the rest is wishy-washy, unfair BS as far as many of our brains work much of the time.
"Get off on a technicality" is largely police propaganda. The "technicality" tends to be the police violated their rights in some way or did something illegal.
And if you get to trial (without being coerced into a guilty plea for something you may not have done [1]), the cops will lie constantly in police reports and even on the stand. It happens so often it has a name: testalying [2].
The well-connected don't really get off on a "technicality". They simply never get charged in the first place. Only two people were ever charged because of Jeffrey Epstein. One is Epstein, who died, and the other is Ghislane Maxwell who got convicted of trafficking minors to... nobody in particular... and is now in a low-security work camp it's technically illegal for sex offenders to be in.
And even if somehow you, as a connected person, are charged and convicted, well you just buy a pardon [3].
North Dakota jury breaking things to please Daddy T. On a larger scale, jury trials for defamation are a ticking time bomb catastrophe, long term incompatible with free speech. Source: was on a defamation jury and it was an utter clown fiesta and pretty close to ended my remaining faith in the US legal system and the population as a whole.
Jury trials for defamation have been a thing since forever and haven't caused a catastrophe yet. Jury trials occasionally get things wrong as a matter of law, which is why we have an appeals process.
reply