Machine Learning

↧

[D] Self-Promotion Thread

March 1, 2025, 7:15 pm

Please post your personal projects, startups, product placements, collaboration needs, blogs etc.Please mention the payment and pricing requirements for products and services.Please do not post link...

View Article

[D] Monthly Who's Hiring and Who wants to be Hired?

May 30, 2025, 7:30 pm

For Job Postings please use this templateHiring: [Location], Salary:[], [Remote | Relocation], [Full Time | Contract | Part Time] and [Brief overview, what you're looking for]For Those looking for jobs...

View Article

[D] How to become fluent at modifying/designing/improving models?

July 2, 2025, 12:50 am

By fluency I mean:Read a paper and and without much problem implement the techniques mentioned, whether it's building something from scratch using the paper as guidance (even in the absence of code),...

View Article

[D] How will LLM companies deal with CloudFlare's anti-crawler protections,...

July 2, 2025, 12:44 am

Yesterday, Cloudflare had announced that their protections against AI crawler bots will be turned on by default. Website owners can choose to opt out if they wish by charging AI companies for scraping...

View Article

[D] Applicability of a Biomedical based AI/ML PhD to other AI/ML fields

July 2, 2025, 4:27 pm

Hey all,I am a first year PhD student in a top biomedical program in the US. One of the labs I am most interested in studies how to more effectively use AI/ML to enhance the drug discovery and...

View Article

Image may be NSFW.
Clik here to view.

[P] The tabular DL model TabM now has a Python package

July 2, 2025, 6:18 am

Hi! My colleagues have recently published a Python package for TabM -- a simple and powerful DL architecture for solving predictive tasks on tabular data (classification, regression, etc.).In a...

View Article

[D] What Tool to Use to Create Illustrations Like This?

July 2, 2025, 7:17 pm

Recently, I’ve seen many researchers adopt this style of illustration to present an architectural view of their method or approach. These visuals are clean, professional, and visually appealing,...

View Article

[D] Why DragGAN is not going viral as other image models

July 3, 2025, 2:28 am

I remember how impressed I was when I first saw its demo videos. But after two years, it hasn’t reached the level of popularity I expected. Why is that? Just because natural language isn't involved?...

View Article

[D] Machine Learning Cheat Sheet Material

July 2, 2025, 3:31 pm

Linear Algebra Cheat SheetSuper VIP Cheatsheet: Artificial IntelligenceVIP Cheatsheet: Transformers and Large Language Models (LLMs)VIP Cheatsheet: Deep LearningSuper VIP Cheatsheet: Machine Learning...

View Article

[D] Are NLP theory papers helpful for industry research scientist roles?

July 2, 2025, 10:34 pm

Currently I'm quite interested in NLP theory, and have some questions about how to make them count for RS roles in industry roles at top AI labs. (1) Does the number of papers help? My impression is...

View Article

[D] Paper with code is completely down

July 2, 2025, 7:33 pm

Paper with Code was being spammed (https://www.reddit.com/r/MachineLearning/comments/1lkedb8/d\_paperswithcode\_has\_been\_compromised/) before, and now it is compoletely down. It was also down a...

View Article

[D] AAAI-2026 2 phase review discussion

July 3, 2025, 12:28 am

AAAI-26' Two-phase reviewing for the Main Track:https://aaai.org/aaai-launches-ai-powered-peer-review-assessment-system/Phase 1: Two reviews supplemented by one AI-generated, non-decisional...

View Article

[D] AI/ML interviews being more like SWE interviews

July 2, 2025, 9:15 pm

Have people noticed that AI/ML/DS job interviews now feel more SWE-like? For example, relying more on data structures and algorithms leetcode questions. I’ve noticed in my professional friend groups...

View Article

[D] What operations should I fuse in a transformer?

July 3, 2025, 9:57 am

I am pretraining a GPT-style language model with PyTorch XLA and wanted to know what operations to fuse with Pallas. I use rotary positional embeddings, SwiGLU, and RMSNorm, and I am working on adding...

View Article

[P] Built a semantic search API

July 3, 2025, 10:25 am

Working on a project that needed both semantic search and content moderation, so I built an API that handles both.The problem it solves: Expensive GPU instances required for inference, hard to scale...

View Article

[R] Permutation Neuron: Achieving 77% Accuracy on MNIST with Three Neurons

July 3, 2025, 11:05 am

This article addresses the challenge of classification with minimal multiplication operations while maintaining accuracy above 75%. The MNIST dataset serves as an example, where a single permutation...

View Article

[D] Hyperparameter Optimization with Evolutionary Algorithms: A Biological...

July 3, 2025, 10:47 am

Data Science is a fascinating field, with always something to learn. Recently, I came across an interesting (though not ideal) approach to hyperparameter optimization: Evolutionary Algorithms (EA). EAs...

View Article

[D] OpenAI Board Member on the Future of Machine Learning

July 4, 2025, 3:26 am

https://www.youtube.com/watch?v=-_M5PY5BC9I submitted by /u/Electrical_Ad_9568 [link] [comments]

View Article

[R] Ring Quantization: Achieving 90% on CIFAR-10 with 2-bit Networks

July 3, 2025, 11:45 am

Hi r/MachineLearning, I'm an independent researcher from Uzbekistan, and for the last few months, I've been working on a new quantization method in my spare time. Today, I'm incredibly excited to...

View Article

[R]Group Recommendation Systems — Looking for Baselines, Any Suggestions?

July 3, 2025, 8:49 pm

Does anyone know solid baselines or open-source implementations for group recommendation systems?I’m developing a group-based recommender that relies on classic aggregation strategies enhanced with a...

View Article

[P] Why am I getting poor performance with GNNs for edge prediction from node...

July 4, 2025, 12:41 am

Hi everyone,I'm working on an industrial use case where I tried to use a Graph Neural Network to **predict edges between tasks**, based solely on node features.Each graph represents 10-60 tasks...

View Article

[D] Understanding Optimal Batch Size Calculation - Arithmetic Intensity

July 3, 2025, 11:56 pm

I encountered this talk where the speaker (Timothée Lacroix of Mistral) states that an optimal batch-size is hardware dependent and can be calculated as 2xflops/mem_bandwidth (6:40) -- Hence an...

View Article

[D] Sampling technique for imbalanced dataset of a OOS prediction model

July 3, 2025, 10:12 pm

Hey all,I’m trying to build ML model for OOS prediction of an item of an imbalanced dataset, which sampling technique should I use and how should I evaluate that sampling technique to create a better...

View Article

[D] Is MBZUAI a reputable institution?

July 3, 2025, 7:39 pm

I have been offered a PhD position and am wondering if it’s a good idea. My supervisor would be one of the top faculty but I’m concerned that the institution doesn’t have strong accolades. I know...

View Article

[D] OpenAI Board Member on ML Research in Industry vs. Academia

July 4, 2025, 1:36 am

https://www.youtube.com/watch?v=-_M5PY5BC9I submitted by /u/Electrical_Ad_9568 [link] [comments]

View Article

[D] A Serious Concern on the ACL Rolling Review System

July 3, 2025, 2:16 pm

While I understand the traditional conference review paradigm involving initial scores, author rebuttals, and final scores, this model is beginning to show clear cracks under the scale and...

View Article

[D] Position: Machine Learning Conferences Should Establish a "Refutations...

July 3, 2025, 11:52 am

We recently released a preprint calling for ML conferences to establish a "Refutations and Critiques" track. I'd be curious to hear people's thoughts on this, specifically (1) whether this R&C...

View Article