[D] Monthly Who's Hiring and Who wants to be Hired?
For Job Postings please use this templateHiring: [Location], Salary:[], [Remote | Relocation], [Full Time | Contract | Part Time] and [Brief overview, what you're looking for]For Those looking for jobs...
View Article[D] Self-Promotion Thread
Please post your personal projects, startups, product placements, collaboration needs, blogs etc.Please mention the payment and pricing requirements for products and services.Please do not post link...
View Article[R] Mitigating Real-World Distribution Shifts in the Fourier Domain (TMLR)
TLDR: Do unsupervised domain adaption by simply matching the frequency statistics of train and test domain samples - no labels needed. Works for vision, audio, time-series. paper (with code):...
View Article[R] How Do Large Language Monkeys Get Their Power (Laws)?
submitted by /u/RSchaeffer [link] [comments]
View Article[D] ICASSP 2025
Hi there, will be attending ICASSP this year. Was wondering if there are folks from the community attending the conference as well. Probably we can catch up sometime. PS: Has already reached the venue...
View Article[R] Improving Generalist Reward Models with Self-Principled Critique Tuning...
DeepSeek's new reward modeling approach uses inference-time scaling to significantly outperform existing systems. Their DeepSeek Generalist Reward Model (GRM) introduces Self-Principled Critique...
View Article[Discussion] This might be a really dumb question regarding current training...
So why can't we train a very large network at low quantization, get the lowest test error possible, prune the network at the lowest test error epoch, and then increase the quantization or the remaining...
View Article[R] Novel Logic-Enhanced LLM for Improved Symbolic Reasoning
I’m experimenting with a novel approach that integrates symbolic logic directly into a transformer’s attention mechanism. By using a custom spaCy-based logic parser, I generate a “logic mask” that...
View ArticleKDD 2025 [Cycle 2] Reviews Are Out!
Hi everyone,KDD 2025 paper reviews are visible on OpenReview. With the reviews released, I thought I would create a discussion thread to gather thoughts, questions and recommendations or anything else....
View Article[D] Are Domain Adversarial Neural Networks (DANN) used in real world...
I find the idea presented in that paper very attractive, being able to train on one controlled domain, for which it is easy to label data, and "transfer" it to another domain which can be quite hard to...
View Article[P] anyone working on Arabic OCR?
all the OCRs i tried for Arabic don’t work well at all. i’m really interested in working on building a proper Arabic OCR. if you know anyone working on it or any open projects, please let me know. i’d...
View Article[D] Has anyone else observed structured, persistent linguistic emergence in...
This is but one small piece of a large amount of phrases I have been working with in an LLM. This arose without any attempt on my part to get the system to speak in another language. It arose...
View Article[D] Rich Sutton: Self-Verification, The Key to AI
submitted by /u/jsonathan [link] [comments]
View Article[D] ICML 2025 - what if reviewers don't acknowledge rebuttal?
2 out of my 5 reviewers at ICML didn't acknowledge my rebuttal at all. Not only no answer, they also didn't even click the "acknowledge rebuttal" at all. According to ICML rules, they are required to...
View Article[R] NoProp: Training neural networks without back-propagation or...
https://arxiv.org/pdf/2503.24322Abstract The canonical deep learning approach for learning requires computing a gradient term at each layer by back-propagating the error signal from the output towards...
View Article[N] Llama 4 release
Llama4 ELO score vs costhttps://www.llama.com/ submitted by /u/we_are_mammals [link] [comments]
View Article[P] A tool to create a ranked list of projects in ML/AI for CS students
TL; DRThis is still work in progress, but I want to hear your early feedback!Inspired by a recent post by Neel Nanda on Research Directions in explainable AI, I'm building a tool that extracts projects...
View Article[P] Sales forecasting based on historic sales, need some help. Starter in ML...
Hi, guys. How are you? First post here. I am working on a sales forecasting problem. I have 2017-2019 data, it has per day sales of different products and if they were on discount or not, unit retail...
View Article[R] [D] harmonic clustering a new approach to uncover music listener groups....
i recently completed a project called harmonic clustering where we use network science and community detection to uncover natural music listener groups from large scale streaming data.the twist is we...
View Article[N] CfP MIDAS workshop @ECML-PKDD 2025 - 10th Workshop on MIning DAta for...
================================================================================ MIDAS 2025 The 10th Workshop on MIning DAta for financial applicationS September 15 or 19, 2025 - Porto, Portugal...
View Article[D] How to handle limited space in RAM when training in Google Colab?
Hello, I am currently trying to solve the IEEE-CIS Fraud Detection competition on kaggle and I have made myself a Google Colab notebook where I am working with the data. The issue I have is that that...
View Article[D] Scanning the OpenAI cookbook for vulnerabilities (with open-source)
submitted by /u/imalikshake [link] [comments]
View Article[D]IJCAI 2025 reviews and rebuttal discussion
Thread for discussion submitted by /u/BigJuggernaut7380 [link] [comments]
View Article[D] Everyday examples of non-linearly separable problems
I'm trying to think of examples that help to intuitively understand the concept of non-linearly separable problems. For example, determining if two inputs are equal is one such problem, but I'm hoping...
View Article[R] Image classification by evolving bytecode
Over the last few years, I’ve been working on Zyme, an esoteric language for genetic programming: creating computer programs by means of natural selection. I’ve started seeing promising results,...
View Article[R] SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
submitted by /u/AhmedMostafa16 [link] [comments]
View Article[R] Uniformly distributed deep feature representations improve fairness &...
TLDR: Theoretically and empircally demonstrates that encouraging deep feature represenatations to be uniformly distributed improves fairness and robustness (specifically, sub-group robustness and...
View Article