[D] Monthly Who's Hiring and Who wants to be Hired?
For Job Postings please use this templateHiring: [Location], Salary:[], [Remote | Relocation], [Full Time | Contract | Part Time] and [Brief overview, what you're looking for]For Those looking for jobs...
View Article[D] Self-Promotion Thread
Please post your personal projects, startups, product placements, collaboration needs, blogs etc.Please mention the payment and pricing requirements for products and services.Please do not post link...
View ArticleIncoming ICML results [D]
First time submitted to ICML this year and got 2,3,4 and I have so much questions:Do you think this is a good score? Is 2 considered the baseline? Is this the first time they implemented a 1-5 score...
View Article[P] Fire detection drone
I’ve been given this project where I have to put a camera on a drone and somehow make it detect fires. The thing is, I have no idea how to approach the AI part. I’ve never done anything with computer...
View ArticleWhisper Translation Finetuning [P]
I am trying to finetune whisper for live translation. My input will be audio from lang-A and the output will be in English text. I created a dataset using indicTrans2 and google fleurs. It adds a...
View Article[D] Consistently Low Accuracy Despite Preprocessing — What Am I Missing?
Hey guys,This is the third time I’ve had to work with a dataset like this, and I’m hitting a wall again. I'm getting a consistent 70% accuracy no matter what model I use. It feels like the problem is...
View Article[D] WGAN-GP loss stuck and not converging.
I implemented a wgan-gp from scratch in pytorch and the loss is not convering. The generator loss rises to 120 and the critic loss drops to -100 and both stops there and the images generated are some...
View Article[D] Eyebrow Simulation using AR and Facial Recognition
Good Day everyone! I am a 3rd year student from PH. This semester were conducting our capstone. We're building a web based app for a salon business that especialize on eyebrows. Our web has a feature...
View ArticleLearnable matrices in sequence without nonlinearity - reasons? [R]
Sometimes in ML papers I see architectures being proposed which have matrix multiplications in sequence that could be collapsed into a single matrix. E.g. when a feature vector x is first multiplied by...
View ArticleHow to handle imbalanced output scales in PINN/PI-DeepONet loss function? [R]
Hi everyone, I’m working on PINNs and PI-DeepONet with multiple outputs, and my loss function only includes residuals. No data loss. The issue is that one of the outputs is much smaller in magnitude...
View Article[D] Simple Questions Thread
Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instead!Thread will stay alive until next one so keep posting after...
View ArticleSEFA: A Self-Calibrating Framework for Detecting Structure in Complex Data...
I've developed Symbolic Emergence Field Analysis (SEFA), a computational framework that bridges signal processing with information theory to identify emergent patterns in complex data. I'm sharing it...
View Article[P] Looking for ModaNet dataset
Long time lurker, first time poster. Please let me know if this kind of question isn't allowed!Has anybody used ModaNet recently with a stable download link/mirror? I'd like to benchmark against...
View Article[D] Are weight offloading / weight streaming approaches like in Deepseek Zero...
As someone from a developing nation which simply cannot afford to keep up GPU purchases with LLM scaling trends, I'm invested in the question of LLM inference in disproportionately low-VRAM...
View Article[D] ICML 2025 Results Will Be Out Today!
ICML 2025 decisions will go live today. Good luck, everyone. Let's hope for the best! 🤞 https://icml.cc/ submitted by /u/darkknight-6 [link] [comments]
View Article[R] Reinforcement Learning for Reasoning in Large Language Models with One...
https://preview.redd.it/7ftw52jynaye1.png?width=1230&format=png&auto=webp&s=92b838b886206d020d7d43c536f237c9dfd89d2dtitle speaks for itself submitted by /u/Classic_Eggplant8827 [link]...
View Article[R] Meta releases synthetic data kit!!
Synthetic Data Kit is a CLI tool that streamlines the often overlooked data preparation stage of LLM fine-tuning. While plenty of tools exist for the actual fine-tuning process, this kit focuses on...
View Article[D] Best Free AI Tools of 2025
I've been exploring a bunch of AI tools this year and figured I’d share a few that are genuinely useful and free to try. These cover a range of use cases—writing, voice generation, profile photos, and...
View ArticleCurrent data controls against a synthetic flood [D]
Considering a significant potential risk for AI and the internet: the 'Infected Corpus', a scenario where generative AI is used to flood the internet with vast amounts of plausible fake content,...
View Article[D] The leaderboard illusion paper is misleading and there are a lot of bad...
Recently this paper came out with the title "The Leaderboard Illusion". The paper critiques the lmsys leaderboard. While the contents of the paper appear to be solid and reasonable critiques, the title...
View Article[P] OpenAI-Evolutionary Strategies on Lunar Lander
I recently implemented OpenAI-Evolutionary Strategies algorithm to train a neural network to solve the Lunar Lander task from Gymnasium.https://youtu.be/FSIsw583hcc?feature=shared submitted by...
View Article[D] Submitting applied ML papers to NeurIPS
I have a project and corresponding research paper ready that I have been working on for a while, and I just got finished now a few weeks before the NeurIPS deadline. My paper is definitely on the more...
View Article[P] - Deep reinforcement Learning with Unreal Engine
Hey everyone! I recently created UnrealMLAgents — a plugin that brings the core features of Unity ML-Agents into Unreal Engine.Unreal Engine is a high-fidelity game engine great for simulations, while...
View Article[D] Don't remember the name of ML paper about how research done, maybe you...
Hi, I remember once I stumbled upon second meaning of SGD acronym, about professor sending their graduate students to keep trying everything till get something, and once they get better result - try to...
View Article[D] Papers/ tips for creating an activation-atlas like this google/open-ai one?
I want to create an activation atlas like the one made by Google and OpenAI in 2019 (https://distill.pub/2019/activation-atlas/ ). However the "lucid" package they used is not up-to-date.I've found...
View Article[R] Leaderboard Hacking
In this paper, “Leaderboard Illusion”, Cohere + researchers from top schools show that Chatbot Arena rankings are rigged - labs test privately and cherry-pick results before public release, exposing...
View Article