Quantcast
Channel: Machine Learning

[D] Monthly Who's Hiring and Who wants to be Hired?

For Job Postings please use this templateHiring: [Location], Salary:[], [Remote | Relocation], [Full Time | Contract | Part Time] and [Brief overview, what you're looking for]For Those looking for jobs...

View Article


[D] Simple Questions Thread

Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instead!Thread will stay alive until next one so keep posting after...

View Article

[R] Never Train from scratch

https://arxiv.org/pdf/2310.02980The authors show that when transformers are pre trained, they can match the performance with S4 on the Long range Arena benchmark. submitted by /u/Whatever_635 [link]...

View Article

[D] I think i figured out how to build General Intelligence. Want to get some...

I advice you read this post in this manner. More understandable, fun that way.Copy the content of this post and paste it to your favorite LLM (Preferably Claude 3.5 Sonnet). Just the text, without any...

View Article

[R] Help with CNN-RNN Architecture for Self-Supervised Matrix Completion

Hi all, I’m working on a self-supervised learning approach to estimate missing or uncertain data in a freeway traffic density dataset, inspired by matrix completion methods.The dataset is generated...

View Article


[D] Can we transfer language capabilities of one LLM to another?

I have seen techniques to transfer/effectively let one model teach another model its unique capability/domain knowledge. But can this made possible for language capability as well? For example, if we...

View Article

[D] Need Advice Starting my Recommendation Engine Project for my Employer

Title sums it up. I'm mostly familiar with time series prediction models, as that's what I've spent most of my time building (I'm a data analyst that's recently built some cool ML stuff). But I need to...

View Article

[D] Which LLM do you use for analysing Financials, P & Ls, Balance Sheets?

If any of you has tried different LLMs, I am super curious which one did you find works great for analysing Financials, P/Ls, Balance Sheets for a company?I am looking to use it regularly so it'd be...

View Article


[D] Genuine Question: Why people want run local LLM?

Since the new models o1, 4o, Claude, for example, are so powerful and have a relatively low subscription and api cost, what would justify someone today trying to install limited local LLM models of up...

View Article


[D] Autograd vs JAX? Both are google products aimed at gradient based...

Just recently saw Autograd(library) by google people that thinly wraps numpy to offer backprop. JAX also does this but rewrites numpy basically. What’s the difference? Is it the gpu tpu support of JAX?...

View Article

[D] [R] Problems understanding DSP-like pipelines

I'd like to hear your opinion on this new paradigm of interacting with LLMs. In particular, I'm talking about "simple" stuff like Reflection (like Self-refine and Reflexion), up to more complex stuff...

View Article

[D] Inference time as a function of the number of tokens when using Flash...

Hello, I'm looking for a graph illustrating the inference time of language models with Flash Attention across different numbers of tokens. I looked for such a comparison on the internet but found...

View Article

[D] International masters student in the US, should I take up research or...

Hi everyone,I’m an international student from a developing country currently pursuing my master’s degree in the U.S. I’m working in a research lab, which is interesting, but my goal is to get a job...

View Article


[D] Struggling with Autoencoder-Based Anomaly Detection for Fraud Detection –...

Hey everyone! 👋I’m currently working on training an Autoencoder for anomaly detection in fraudulent card transactions, but I’m hitting a roadblock. The performance has been underwhelming, with a...

View Article

[D]player identification and tracking in basketball videos(computer vision)[D]

I'm starting a project for a client in the sports industry where the goal is to identify basketball players from vídeos, specifically what they want is the player number, this is just the first step...

View Article


[D] How to run a Federated Learning simulation on a custom dataset where I...

So I was looking at flwr for this task and I found a lot of partitioners but nothing could get the job done (I could be missing out too)Have you guys tackled such a problem?For a better understanding,...

View Article

[P] YOLOv8 .pt File for General Object Detection Across Multiple Environments...

Could someone provide the best possible .pt file for YOLOv8 for general object detection, covering environments like colleges, offices, and homes, with a dataset containing at least 50 classes?...

View Article


[D] what techniques i can use to maintain uniformity in image generation

I am working on a NLP project which1)takes a txt file as input2) extracts information in a pre-defined writeup using Gemini api3) uses DistilBert to summerise the main file4) and using ROUGE with...

View Article

[P] Open Source Modular Tool For LLM Reverse Engineering and Red Teaming

https://github.com/user1342/Oversight submitted by /u/OppositeMonday [link] [comments]

View Article

[D] Get papers peer-reviewed and published quickly

Hi! I have some work that I would like to get peer-reviewed and published. I'm not aiming for top journal, I'm looking for options where the publication process is relatively fast. Do you have any...

View Article

[D] On obscurities and missed links with Normalizations

Although being almost anywhere, I keep noticing how obscure are normalization techniques, both to redditors and technicians, possibly.InstanceNorm, GroupNorm, BatchNorm, LayerNorm are all computing...

View Article


[D] Evolving Matrix Computation Techniques for Modern AI: What's New?

As AI models continue to scale in both complexity and size, I'm interested in how the field of matrix computations is evolving to meet these new challenges. What are some of the latest advancements or...

View Article


[P] I made a tool for building and training neural networks visually,...

Hey! I mostly made this as a tool to learn how to implement backpropagation and get some intuition on how it works, so I figure it might be useful for someone else! I also wrote up an article in the...

View Article

Image may be NSFW.
Clik here to view.

[R] Amazon Researchers Find LLMs do not always follow User Requests and...

Came across this interesting paper being presented next week at EMNLP 2024: LLM Self-Correction with DECRIM: DECOMPOSE, CRITIQUE, AND REFINE for Enhanced Following of Instructions with Multiple...

View Article

[D] As a researcher, how do you become industry-ready?

Being a PhD student, much of my time is spent on supervising students, project management and writing "quick and dirty" code for prototyping. I intend to move to industry after the PhD, but I feel like...

View Article


[R] Embedding models are not able to capture neutral semantics: applied to a...

In this paper: On Debiasing Text Embeddings Through Context Injection, it is shown that embedding models are not able to truly capture the semantics of "neutral" text.This phenomenon is studied under...

View Article

[D] Want to move away from coding heavy ML but still want to complete the PhD

Hi Folks,I come from a tradition electrical engineering background doing things like industrial automation and computer vision. I decided to pursue a PhD in ML as I thought it will be a good field to...

View Article