More

adius · 2026-02-28T19:32:04 1772307124

Mh, I thought about this a little and came actually to exactly the opposite conclusion: Implement as much as possible in Rust to get the fastest code possible. Do you have any more insights why this should not be possible / unsustainable?

Grosvenor · 2026-02-28T19:41:14 1772307674

You have two distinct products 1) An interpreter 2) a math language. Don't write your math in some funny imperative computer language.

Keep the interpreters surface area as small as possible. Do some work to make sure you can accelerate numeric, and JIT/compile functions down to something as close to native as you can.

Wolfram, and Taliesin Beynon have both said Wolfram were working internally to get a JIT working in the interpreter loop. Keep the core small, and do that now while it's easy.

Also, it's just easier to write in Mathematica. It's probably 10x smaller than the rust code:

    f[x_Integer]:=13*x;
    f::help:="Multiplies x by 13, in case you needed an easy function for that."

EDIT: Another important thing to note is the people who really deeply know specific subjects in math won't be the best, or even good rust programmers. So letting them program in woxilang will give the an opportunity to contribute which they wouldn't have had otherwise.

theowaway213456 · 2026-03-01T20:51:18 1772398278

I'm not a PL expert but isn't building a decent JIT a massive undertaking? I guess you're saying that the JIT itself would be what makes a project like this worth using in the first place?

Arelius · 2026-03-01T21:55:04 1772402104

It's like most things in software, if you constrain the problem enough, focus on the problems you actually have and make some smart choices early on, it can be a very modest lift on the order of a week or two for a 90% solution, but on the other end of the spectrum, it's a lifetime of work for a team of hundreds...

layer8 · 2026-02-28T22:25:16 1772317516

Symbolic manipulation?

adius · 2026-02-28T18:27:23 1772303243

Why should I want them to use it?

oofbey · 2026-02-28T20:03:59 1772309039

Well it would get vastly more popular if it was officially endorsed. Do you want lots of people using it?

pwdisswordfishy · 2026-02-28T20:29:58 1772310598

Why is that important?

oofbey · 2026-03-01T20:35:02 1772397302

Most humans like external validation. Having other people use what they've built is a common metric of validation in open source software. Economically, these metrics can be used as credentials for things like job applications or consideration in joining groups.

poly2it · 2026-02-28T20:50:55 1772311855

Because you get changes upstreamed.

adius · 2026-02-28T17:08:13 1772298493

They rebranded it to Wolfram Language a few years ago (which I actually appreciate, as it is so much more than just "math" by now!)

https://writings.stephenwolfram.com/2013/02/what-should-we-c...

adius · 2026-02-28T16:52:09 1772297529

Oh cool, haven't heard of this before. Could be a good fit - I'll have to try it out some day!

adius · 2026-02-28T16:25:32 1772295932

Such a massive undertaking would be almost impossible without AI agents, so yeah, they help me. But with around 5000 tests, they are actually helping to improve the software quality!

throawayonthe · 2026-02-28T16:36:48 1772296608

are all the tests hand written or are some agent-contributed? curious

mountainriver · 2026-02-28T16:51:44 1772297504

What’s the difference if you review the code getting merged?

i_cannot_hack · 2026-02-28T17:32:18 1772299938

Reviewing the correctness of code is a lot harder than writing correct code, in my experience. Especially when the code given looks correct on an initial glance, and leads you into faulty assumptions you would not have made otherwise.

I'm not claiming AI-written and human-reviewed code is necessarily bad, just that the claim that reviewing code is equivalent to writing it yourself does not match my experience at all.

tempest_ · 2026-02-28T17:39:51 1772300391

Plus if you look at the commit cadence there is a lot of commits like 5-10 minutes a part in places that add new functionality (which I realize doesn't mean they were "written" in that time)

I find people do argue a lot about "if it is reviewed it is the same" which might be easy when you start but I think the allure of just glancing going "it makes sense" and hammering on is super high and hard to resist.

We are still early into the use of these tools so perhaps best practices will need to be adjusted with these tools in mind. At the moment it seems to be a bit of a crap shoot to me.

mountainriver · 2026-03-03T20:49:26 1772570966

eh with plenty of tests that I can easily read and are well documented I haven't actually ever found this to be a problem in practice

bowsamic · 2026-03-01T09:37:02 1772357822

Code review basically never actually means thinking through all the code again as if you wrote it

IshKebab · 2026-02-28T19:30:59 1772307059

The difference is we can't tell if you reviewed the code.

layer8 · 2026-02-28T22:29:02 1772317742

To be fair, we also couldn’t tell for sure if they hand-wrote the code.

IshKebab · 2026-02-28T22:50:45 1772319045

If they hand wrote the code we know they at least looked at it once.

layer8 · 2026-03-01T00:07:47 1772323667

Since we can’t know whether they really hand-wrote the code, we also wouldn’t know whether they looked at it.

IshKebab · 2026-03-01T08:49:36 1772354976

Err yeah that's the point.

layer8 · 2026-03-01T12:51:14 1772369474

You wrote: “If they hand wrote the code we know they at least looked at it once.”

But that’s not true, because even if they did indeed hand-write the code, we as third parties wouldn’t have any reliable proof of that, and therefore still couldn’t draw the conclusion that they looked at the code.

Them claiming to have hand-written the code isn’t any better than them claiming to have thoroughly reviewed the code. We can’t know in either case.

IshKebab · 2026-03-01T16:03:13 1772380993

> Them claiming to have hand-written the code isn’t any better than them claiming to have thoroughly reviewed the code. We can’t know in either case.

It is better, because most people aren't out-right liars. If he said "I hand-wrote the code", sure it doesn't prove it, but I would believe him. When he says "I thoroughly reviewed the code"... yeah maybe. That's the sort of thing people do lie about (even to themselves).

mountainriver · 2026-03-03T20:50:23 1772571023

you never can, thats the responsibility of the engineers to not lie. You can tell if there are sufficient tests and if they passed

throawayonthe · 2026-02-28T17:40:17 1772300417

i mean idk that's sorta like asking what's the difference of having tests if you review the code getting merged

tock · 2026-02-28T16:53:12 1772297592

Did you actually review 313,397 LOC written by claude? And you wrote the tests? That's honestly very impressive if yes.

adius · 2026-02-28T16:21:33 1772295693

According to https://en.wikipedia.org/wiki/Lotus_Development_Corp._v._Bor...., a software clone does not infringe software copyright. So yeah, I'd guess sooner or later everything is going be cloned …

adius · 2026-02-28T16:09:23 1772294963

Sure, but you've got to start somewhere! And with the amount of progress I was able to make in just a few weeks, I'm very optimistic that the polish will come sooner rather than later.

the__alchemist · 2026-02-28T16:17:05 1772295425

Based on the list of contributors to your project, I am not sure this starting location is optimally suited to the task of building a foundation for polished, reliable, expandable software.

adius · 2026-02-28T16:29:19 1772296159

It's having ~ 5000 tests already. Used correctly, AI agents can help you improve the quality of the code!

the__alchemist · 2026-02-28T16:31:58 1772296318

Do you see why this perspective is a red flag on its own?

ComplexSystems · 2026-02-28T16:56:44 1772297804

I certainly don't. If a software developer has found a way to use these tools that works well for them and produces good results, that's a good thing.

hobs · 2026-03-01T11:50:42 1772365842

I have no dog in this fight, but simply claiming a count of tests get you anything is like saying your code coverage is 100% - it sounds really good until you think about what 5000 unreviewed tests actually... do.

mountainriver · 2026-02-28T16:49:23 1772297363

No I don’t, review your code

supriyo-biswas · 2026-02-28T17:22:51 1772299371

If I go by the contributor numbers on Github, I see Claude has committed something on the order of 300,000 lines of code. I don't think it's reasonable to review that much code, even in weeks worth of time.

zenware · 2026-03-02T05:27:58 1772429278

I haven’t needed to do such a thing in a while, but my “rule” for explaining how unreasonable is to say “if I only glanced at each line of code for 1 second, without bothering to understand the details, it would take me 3 and a half full 24hr days non stop to simply look at”. So it’s definitely more than 1 work week because presumably other stuff is going to need to get done in that time too. Actually understanding it is going to be at least a multiple of that, and probably the multiple is ~30x.

copperx · 2026-02-28T16:38:31 1772296711

God help us.

s3p · 2026-02-28T16:27:12 1772296032

The sneering on HN really has no end. This is a good project! I for one am very excited to see an interpreter born out of rust.

mountainriver · 2026-02-28T16:48:46 1772297326

It’s so obnoxious

Keyframe · 2026-02-28T18:06:29 1772301989

It's a defense mechanism. I was guilty as charge as well initially. Suddenly most of your l33t skillz are trivialized and surpassed by an inhumane actor. It's a tough pill to swallow.

adampunk · 2026-02-28T18:44:13 1772304253

In that case I kindly refer you to the matter of Arkell v. Pressdam.

jjtheblunt · 2026-02-28T18:53:55 1772304835

i'm curious if you intend to reimplement highly optimized numerical algorithms, symbolic algorithms, and so on, accumulated and tuned in mathematica since its 1988 release?

it's a huuuuuuuuge amount of technology in the standard library of mathematica, beyond the surface syntax and rewrite system, i mean.

adius · 2026-02-25T19:53:31 1772049211

Hi, I'm the main developer. We're steadily getting closer to the next release which will support most features of Mathematica 1.0 plus some of the most popular newer functions (> 900 overall!). AMA!

cs702 · 2026-02-28T14:52:07 1772290327

Thank you for sharing this on HN.

It's a worthwhile effort. If successful, Woxi can enable a large mass of scientists and engineers who don't have access to Mathematica to run legacy code written for it. Also, Woxi would give those scientists and engineers who regularly use Mathematica a non-proprietary, less restrictive alternative, which many of them would welcome.

How does Woxi compare to other "clean-room implementations"[a] of the same language?

--

[a] Please check with a lawyer to make sure you won't run into legal or copyright issues.

egl2020 · 2026-02-28T16:30:42 1772296242

There's a mystique around Mathematica's math engine. Is this groundless, or will you eventually run into problems getting correct, identical answers -- especially for answers that Mathematic derives symbolically? The capabilities and results of the computer algebra systems that I've used varied widely.

adius · 2026-02-28T16:37:43 1772296663

Hard to tell honestly. So far there was always some surprisingly straight forward solution If had any problems with the math engine. There is actually a lot of public research how equations can be solved/simplified with computer algorithms. So I'm optimistic. I also stumbled upon a few cases where Mathematica itself didn't quite do things correctly itself (rounding errors, missing simplifications, etc.). So maybe it's actually a little overhyped …

Y_Y · 2026-02-28T17:27:28 1772299648

I also found problems with integrating some obscure functions a few years black, though IIRC the issue was remedies by using the amazing Rubi package:

https://rulebasedintegration.org/

redbluered · 2026-02-28T23:41:20 1772322080

Scmutils from MIT does a very good -- arguably better -- job for correctness. No symbolic integration by ideology and not identical. Sussman and Terman. Amazing attention to detailand correctness. Claude could probably bridge Scheme to Wolfram.

I'm not sure how important but- for-bug identical output really is.

lupire · 2026-03-02T00:29:53 1772411393

Mathematica gets incorrect answers quite frequently.

muizelaar · 2026-02-28T14:43:18 1772289798

How does it compare to mathics?

How close is it to being able to run rubi: https://rulebasedintegration.org/?

disentanglement · 2026-02-28T14:47:54 1772290074

You can find the whole integration routine here https://github.com/ad-si/Woxi/blob/36343ab2cbc97e5081f7c811e.... It's essentially a test mock-up (not even trying partial fraction decomposition for rational functions).

anematode · 2026-03-01T00:33:45 1772325225

    // ∫ tan(x) dx = -ln|cos(x)| - not implemented (requires Log)

This really doesn't bode well... I'm no expert in CASes, but everything I've looked at seems very naive. Vibe coding a Mathematica replacement makes zero sense to me.

anematode · 2026-03-01T05:45:43 1772343943

Reported a few issues, e.g. https://github.com/ad-si/Woxi/issues/16

utopiah · 2026-02-28T15:32:50 1772292770

Interesting, thanks for sharing. Naive question as I'm not familiar with Mathematica much (but aware of it and Wolfram Alpha and related tools), how does it compare to e.g. Jupyter or Julia or maybe another language (with its framework) that might be even closer?

adius · 2026-02-28T16:42:55 1772296975

I think Wolfram Language is just so much more ergonomic. No need to import dependencies - everything's included and consistent, very readable - yet compact - syntax, less gotchas than Python, R, etc., sensible default, …

robotpepi · 2026-03-01T18:47:31 1772390851

I'd add the pattern matching syntax. it's so natural for a mathematician.

Y_Y · 2026-02-28T17:33:00 1772299980

Ymmv, but I've found that you sure do need to import things eventually, and it's not so ergonomic because most projects just end up as mega-notebooks.

Just like Python or any other language that looks easy for the learning examples, there are still hairy bits, they're just better hidden. The difference is that the debuggers for Python are far better.

Mathematica is great for quick stuff, but once you hit a particular level complexity it goes crazy. In this regard I find it similar to Bash.

apetresc · 2026-02-28T14:42:23 1772289743

What percentage of the overall code was written primarily by agents?

cs702 · 2026-02-28T14:54:55 1772290495

https://github.com/ad-si/Woxi/graphs/contributors

dist-epoch · 2026-02-28T15:35:22 1772292922

Why would I use this and not Wolfram Script?

Better license? Allowed for commercial operations?

redbluered · 2026-02-28T23:44:11 1772322251

License is a big deal, and not just for cost and openness, but also for practical use in pages like docker, ci/cd pipeline, cloud deployments, or other places licenses need to be dynamic.

adius · 2026-02-28T16:17:21 1772295441

Exactly! And:

- Faster startup time because of no license check

- Can run multiple instances of Woxi at the same time

- Embeddable via WASM

- Configurable via compile time flags (which features should be included)

- …

foobarqux · 2026-02-28T16:03:22 1772294602

How is the popularity/rank in functions.csv determined?

adius · 2026-02-28T16:14:22 1772295262

That's actually a value that Wolfram determines themselves. Here is the documentation for it: https://reference.wolfram.com/language/ref/WolframLanguageDa...

Here is e.g. all the values for the Plus[] function:

$ wolframscript -code 'WolframLanguageData["Plus", "Ranks"]' {All -> 6, StackExchange -> 8, TypicalNotebookInputs -> 5, TypicalProductionCode -> 6, WolframAlphaCodebase -> 6, WolframDemonstrations -> 4, WolframDocumentation -> 4}

foobarqux · 2026-02-28T16:43:13 1772296993

Have you considered using quickcheck/random/property-based testing with LLM code generation to automate function implementation?

adius · 2026-02-28T17:20:37 1772299237

Yeah, I've already looked into it, but decided to keep developing it "example driven" for now. Aka I'm playing around with it, and whenever I find something that's broken I keep a note of it and then I pick those notes one by one and implement them. Once the most common things are implemented I will start writing property tests to catch all the edge cases of each feature.

foobarqux · 2026-02-28T17:48:29 1772300909

I'm saying you can go even further and automate the entire thing using LLMs/agents, it is pretty much the ideal use case: you have a black-box reference implementation to test against; descriptive documentation for what the functions should do; some explicitly supplied examples in the documentation; and the ability to automatically create an arbitrary number of tests.

So not only do you have a closed loop system that has objective/automatic pass-fail criteria you also don't even have to supply the instructions about what the function is supposed to do or the test cases!

Obviously this isn't going to be 100% reliable (especially for edge cases) but you should be able to get an enormous speed up. And in many cases you should be able to supply the edge case tests and have the LLM fix it.

(Codex is still free for the next few days if you want to try their "High"/"Extra high" thinking models)

thrtythreeforty · 2026-02-28T19:53:04 1772308384

You accidentally raise an interesting point: good, thorough public documentation, once considered a great selling point for your system, now invites automated reimplementation by competition. It would be a shame to see public docs vanish because it turns out they are literally machine readable specs.

jacquesm · 2026-03-01T02:47:36 1772333256

What is your plan for long term support?

pwdisswordfishy · 2026-02-28T20:31:09 1772310669

adius · 2026-02-28T20:52:47 1772311967

Not yet, but on the roadmap!

adius · 2026-02-24T12:40:12 1771936812

https://github.com/ad-si/Woxi

adius · 2026-02-24T06:49:40 1771915780

I also think that sandboxing is crucial. That’s why I’m working on a Wolfram Language interpreter that can be run fully sandboxed via WebAssembly: https://github.com/ad-si/Woxi

danpalmer · 2026-02-24T10:10:51 1771927851

Awesome. I'm pretty unfamiliar with the Wolfram Language, but my understanding that the power of it came from the fact it was very batteries-included in terms of standard library and even data connections (like historical weather or stock market data).

What exactly does Woxi implement? Is it an open source implementation of the core language? Do you have to bring your own standard library or can you use the proprietary one? How do data connections fit into the sandboxing?

I realise I may be uninformed enough here that some of these might not make sense though, interested to learn.

adius · 2026-02-24T10:29:59 1771928999

Yes, we agree that a lot of the value comes from the huge standard library. That's why we try to implement as much of it as possible. Right now we support more than 900 functions. All the Data functions will be a little more complicated of course, but they could e.g make a request to online data archives (ourworldindata.org, wikidata.org, …). So I think it's definitely doable.

We also want to provide an option for users to add their own functions to the standard library. So if they e.g. need `FinancialData[]` they could implement it themselves and provide it as a standard library function.