[D] Monthly Who's Hiring and Who wants to be Hired?
For Job Postings please use this templateHiring: [Location], Salary:[], [Remote | Relocation], [Full Time | Contract | Part Time] and [Brief overview, what you're looking for]For Those looking for jobs...
View Article[D] Self-Promotion Thread
Please post your personal projects, startups, product placements, collaboration needs, blogs etc.Please mention the payment and pricing requirements for products and services.Please do not post link...
View Article[P] Micro Diffusion — Discrete text diffusion in ~150 lines of pure Python
Inspired by Karpathy's MicroGPT, I wanted to build the equivalent for text diffusion — a minimal implementation that shows the core algorithm without the complexity.Autoregressive models generate left...
View Article[D] Industry expectations in Machine Learning Engineers in 2026
submitted by /u/AccomplishedCat4770 [link] [comments]
View Article[D] AI/ML PhD Committee
Hey all — quick question for senior PhD folks.I’m finalizing my Plan of Study and trying to decide on my committee composition. There’s a professor in our department whose work is aligned with mine and...
View Article[D] Works on flow matching where source distribution comes from dataset...
Flow matching is often discussed in the context of image generation from Gaussian noise.In principle, we could model the flow from a complicated image distribution into another complicated image...
View Article[D] ICLR Workshop Results
The ICLR 26 websites mention that the mandatory notification for workshop paper accept/reject is 28 Feb 2026 (AoE).So has anyone received their decisions yet? submitted by /u/Klutzy-Childhood-126...
View Article[R] Tiny transformers (
Really interesting project. Crazy you can get such good performance. A key component is that they are digit tokens. Floating math will be way tricker. submitted by /u/LetsTacoooo [link] [comments]
View Article[R] CVPR'26 SPAR-3D Workshop Call For Papers
If you are working on 3D vision models, please consider submitting your work to the SPAR-3D workshop at CVPR! :)The submission deadline has been extended to March 21, 2026.Workshop website:...
View Article[P] Building A Tensor micrograd
Hi! We're all aware of Andrej Karpathy's micrograd package and his amazing lecture on it. When I saw it a while ago, I was curious how one can develop it into a more standard vectorized package rather...
View Article[R] Detecting invariant manifolds in ReLU-based RNNs
In a new #ICLR2026 publication we provide a novel algorithm for semi-analytically constructing the stable and unstable manifolds of fixed points and cycles of ReLU-based...
View Article[P] R2IR & R2ID: Resolution Invariant Image Resampler and Diffuser - Trained...
This is a continuation of my ongoing project. The previous posts can be foundhereandhere; formerly known as S2ID and SIID before that. Since then, a lot has changed, and R2IR and R2ID work very...
View Article[D] Simple Questions Thread
Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instead!Thread will stay alive until next one so keep posting after...
View Article[P] easy-torch-tpu: Making it easy to train PyTorch-based models on Google TPUs
I've been working with Google TPU clusters for a few months now, and using PyTorch/XLA to train PyTorch-based models on them has frankly been a pain in the neck. To make it easier for everyone else,...
View Article[R] CVPR 2026 Camera Ready Paper
Hi everyone,This is the first time I had an experience with a top machine learning conference. My paper was accepted for CVPR findings, I wanted to know what is the process of submitting the final...
View Article[R] Benchmarked 94 LLM endpoints for jan 2026. open source is now within 5...
been doing a deep dive on model selection for production inference and pulled togethar some numbers from whatllm.org's january 2026 report... thought it was worth sharing because the trajectory is...
View Article[R] Toward Guarantees for Clinical Reasoning in Vision Language Models via...
AI (VLM-based) radiology models can sound confident and still be wrong ; hallucinating diagnoses that their own findings don't support. This is a silent, and dangerous failure mode.Our new paper...
View Article[D] Geospatial ML for humanitarian drought/flood forecasting: critique my...
I'm working on a non-commercial geospatial ML project (AidMap AI) focused on Central Asia/Afghanistan/Syria – predicting "urgency levels" for slow-onset ecological crises (droughts, floods, crop...
View Article[D] ICLR 2026 Registration Process
Hello,I apologize if this is not the correct place to ask this but I couldn't find any subs related to thisI am a first time author and our paper got accepted to ICLR 2026. I was trying to register for...
View Article[D] How to get credits to run experiments on closed source models as a...
Hello! I am working on building and evaluating frontier models on a benchmark. The task is overall pretty reasoning intensive, and ends up consuming a lot of tokens. For reference, in our pilot tests,...
View Article[D] The engineering overhead of Verifiable ML: Why GKR + Hyrax for on-device...
The idea of "Privacy-Preserving AI" usually stops at local inference. You run a model on a phone, and the data stays there. But things get complicated when you need to prove to a third party that the...
View Article[R] TorchLean: Formalizing Neural Networks in Lean
arXiv:2602.22631 [cs.MS]: https://arxiv.org/abs/2602.22631Robert Joseph George, Jennifer Cruden, Xiangru Zhong, Huan Zhang, Anima AnandkumarAbstract: Neural networks are increasingly deployed in...
View Article[P] Vera: a programming language designed for LLMs to write
I've built a programming language whose intended users are language models, not people. The compiler works end-to-end and it's MIT-licensed.Models have become dramatically better at programming over...
View Article[R] Are neurons the wrong primitive for modeling decision systems?
A recent ICLR paper proposes Behavior Learning — replacing neural layers with learnable constrained optimization blocks. It models it as:"utility + constraints → optimal...
View Article[D] How much time do you actually lose trying to reproduce ML papers?
Hey folks! Long-time lurker, first time poster. I’m a PhD student, and I’ve been wondering: how much time do you actually spend just trying to reproduce ML papers? Even when the code is available, it...
View Article