[D] Monthly Who's Hiring and Who wants to be Hired?
For Job Postings please use this templateHiring: [Location], Salary:[], [Remote | Relocation], [Full Time | Contract | Part Time] and [Brief overview, what you're looking for]For Those looking for jobs...
View Article[D] Self-Promotion Thread
Please post your personal projects, startups, product placements, collaboration needs, blogs etc.Please mention the payment and pricing requirements for products and services.Please do not post link...
View Article[D] Relevance of Minimum Description Length to understanding how Deep...
There's a subfield of statistics called Minimum Description Length. Do you think it has a relevance to understanding not very well explained phenomena of why deep learning works, i.e. why...
View Article[P][Q] Help with multilabel classification
Hey guys, so I’m a noob in ML (started learning a month ago.) I’m pretty new to this so correct me if I’m understanding things wrong.Im trying to find out the feature importances in a particular...
View Article[R] Patronus AI, Columbia University and Meta release BLUR benchmark for...
Hugging Face dataset: https://huggingface.co/datasets/PatronusAI/BLUR submitted by /u/Megixist [link] [comments]
View Article[D] Are you happy with the ICML discussion period?
Are you happy with the ICML discussion period?My reviewers just mentioned that they have acknowledged my rebuttals.I'm not sure the "Rebuttal Acknowledgement" button really helped get the reviewers...
View Article[R] Implemented 18 RL Algorithms in a Simpler Way
I decided to create a comprehensive learning project in a Jupyter Notebook to implement RL Algorithms such as PPO, SAC, A3C and more. (Theory + Code).Code, documentation, and example can all be found...
View Article[R] Neuron-based explanations of neural networks sacrifice completeness and...
TL;DR: The most important principal components provide more complete and interpretable explanations than the most important neurons.This work has a fun interactive online demo to play around...
View Article[D] Patience vs batch size
I've written a classification project built on ResNet where I adapt my learning rate, unfreezing layers and EarlyStopping based on a patience variable. How should this patience variable be adapted...
View Article[D] Interpreting Image Patch and Subpatch Tokens for Latent Diffusion
I'm not very familiar with works interpreting patch tokens or representations, aside from [1], a recent work describing how Vision Transformers for Classification improve as patches decrease in size (+...
View Article[D][P][R]Best techniques for Fine-Tuning Embedding Models ?
What are the current SOTA techniques to fine-tune embedding models ? submitted by /u/DedeU10 [link] [comments]
View Article[D] Give me a critique for my book
Hello everyone,A bit of background about myself: I'm an upper-secondary school student who practices and learns AI concepts during their spare time. I also take it very seriously.Since a year ago, I...
View Article[R] Deploy your own AI Operator on macOS
A step-by-step guide to pairing OpenAI's computer-use-preview model with a macOS VM sandbox. Why build your own instead of using ChatGPT's Operator? - Control native macOS apps, not just web - Better...
View Article[D] Time series models with custom loss
Suppose I have a time-series prediction problem, where the loss between the model's prediction and the true outcome is some custom loss function l(x, y).Is there some theory of how the standard ARMA /...
View Article[P] Looking for resources on simulating social phenomena with LLM
I want to simulate social phenomena using LLM agents. However, since my major is in computer science, I have no background in social sciences. Are there any recommended resources or researchers working...
View Article[D] Fine-tuning a fine-tuned YOLO model?
I have a semi annotated dataset(<1500 images), which I annotated using some automation. I also have a small fully annotated dataset(100-200 images derived from semi annotated dataset after I...
View Article[D] Anyone got reviews for the paper submitted to AIED 2025 conference
Anyone got reviews for the paper submitted to AIED 2025 conference? I am yet to receive mine while few others have already got it. Have mailed chairs but doubt if I will get any reply. Anyone connected...
View Article[P] Privately Hosted LLM (HIPAA Compliant)
Hey everyone, I need to parse text prompts from users and map them to a defined list of categories. We don't want to use a public API for data privacy reasons as well as having more control over the...
View Article[R] Speech to text summarisation - optimised model ideas
Hi, I'm a cs major who choose speech to text summarisation as my honors topic because I wanted to pick something from machine learning field so that I could improve my understanding.The primary goal is...
View Article[R] For those of you who are familiar with Kolmogorov Arnold Networks and the...
As the title suggests, I wanted to know if a B-Spline for a given grid can be represented using a Meijer-G function? Or is there any way by which the exact parameters for the Meijer-G function can be...
View Article[R]Struggling to Pick the Right XAI Method for CNN in Medical Imaging
Hey everyone! I’m working on my thesis about using Explainable AI (XAI) for pneumonia detection with CNNs. The goal is to make model predictions more transparent and trustworthy—especially for...
View Article[R] measuring machine translation quality
I want to translate some 100k English sentences into another language. How can I measure the translation quality? Any ideas? submitted by /u/Powerful-Angel-301 [link] [comments]
View Article[D] UAI 2025 Reviews Waiting Place
A place to share your thoughts, prayers, and, most importantly (once the reviews are out, should be soon...), rants or maybe even some relieved comments. Good luck everyone! submitted by...
View Article[R] Multi-Token Attention: Enhancing Transformer Context Integration Through...
Multi-Token AttentionI was reading about a new technique called Multi-Token Attention that improves transformer models by allowing them to process multiple tokens together rather than looking at each...
View Article[R] Position: Model Collapse Does Not Mean What You Think
The proliferation of AI-generated content online has fueled concerns over model collapse, a degradation in future generative models' performance when trained on synthetic data generated by earlier...
View ArticleAI tools for ML Research - what am I missing? [D]
AI/ML Researchers who still code experiments and write papers. What tools have you started using in day-to-day workflow? I think it is way different what other SWE/MLE uses for their work.What I use...
View Article[N] Open-data reasoning model, trained on curated supervised fine-tuning...
Open Thoughts initiative was announced in late January with the goal of surpassing DeepSeek’s 32B model and releasing the associated training data, (something DeepSeek had not done). Previously, team...
View Article