"[D]" Hotel Pricing Optimization Brainstorming
Reddit » Machine Learning
by /u/Due_Ad2638
34m ago
Hey everyone! I'm working on a project with a hotel that wants to use their data for pricing optimization, and I was curious to know what other ML engineers would do in this case. Let's say the hotel gives us all the data about: their competitors prices their favorite market placement within those competitors all nearby and internal events, including holidays (and their importance ranking from 0-100) weather historic internal data (sales, revenue) (own hotel for past 3 years) historic prices (own hotel and all competitors for past 3 years) metasearch data and reviews All this data has 2 col ..read more
Visit website
[P] Multihead Mixture of Experts - Implementation of dense subtoken routing suggested in https://arxiv.org/pdf/2404.15045
Reddit » Machine Learning
by /u/Prudent_Student2839
34m ago
My friend implemented the method of Multihead Mixture of Experts in this arxiv paper https://arxiv.org/pdf/2404.15045 and he wanted me to share it with you! Try it out. Let me know what you think and I will pass it on to him. submitted by /u/Prudent_Student2839 [visit reddit] [comments ..read more
Visit website
[D] HyenaDNA and Mamba are not good at sequential labelling ?
Reddit » Machine Learning
by /u/blooming17
2h ago
Hello guys, I've been working on a sequential labelling using DNA sequences as inputs. Lately there have been 2 foundation models released HyenaDNA (Based on Hyena operator) and Caduceus (based on mamba), I used both pretrained and from scratch models and performances are terrible even with pretrained ones. Does anyone have experience with this type of models, and what are the potential causes for performance drop ? I am literally getting zero performance for the minority class ? Does mamba deal poorly with class imbalance ? submitted by /u/blooming17 [visit reddit] [comments ..read more
Visit website
[D] Does anyone use Bedrock Agents for function calling?
Reddit » Machine Learning
by /u/raman_boom
2h ago
I have a use case to use function calling within my application, I am confused whether to choose OpenAI function calling or use Bedrock Agents coupled with Lambda functions for this, which is the best approach? Or help me to choose between these two. submitted by /u/raman_boom [visit reddit] [comments ..read more
Visit website
[D] What are your horror stories from being tasked impossible ML problems
Reddit » Machine Learning
by /u/LanchestersLaw
2h ago
ML is very good at solving a niche set of problems, but most of the technical nuances are lost on tech bros and managers. What are some problems you have been told to solve which would be impossible (no data, useless data, unrealistic expectations) or a misapplication of ML (can you have this LLM do all of out accounting). submitted by /u/LanchestersLaw [visit reddit] [comments ..read more
Visit website
Datasets for Causal ML [D]
Reddit » Machine Learning
by /u/Direct-Touch469
4h ago
Does anyone know what datasets are out there for causal inference? I’d like to explore methods in the doubly robust ML literature, and I’d like to compensate my learning by working on some datasets and learn the econML software. Does anyone know of any datasets, specifically in the context of marketing/pricing/advertising that would be good sources to apply causal inference techniques? I’m open to other datasets as well. submitted by /u/Direct-Touch469 [visit reddit] [comments ..read more
Visit website
[P] Dreamboothing MusicGen
Reddit » Machine Learning
by /u/Sufficient-Tennis189
4h ago
Dreambooth the MusicGen model suite on small consumer GPUs, in a matter of minutes, using this repository: https://github.com/ylacombe/musicgen-dreamboothing The aim of this project is to provide tools to easily fine-tune and dreambooth the MusicGen model suite, with little data and to leverage a series of optimizations and tricks to reduce resource consumption, thanks to LoRA adaptors. For example, the model can be fine-tuned on a particular music genre or artist to give a checkpoint that generates in that given style. The aim is also to easily share and build on these trained checkpoints, S ..read more
Visit website
[D] Transitioning from Operations Research Scientist to ML/AI/CV Engineer
Reddit » Machine Learning
by /u/unsuccessful_boy
6h ago
Hello fellow smart people on Reddit, recently I've been thinking about changing my job (I'm a Vision Engineer) and I stumbled upon this position on LinkedIn called Operations Research Scientist. I was wondering after a few years (maybe at most 2) of working in that position, will it be easier for me to transition to a Machine Learning/Artificial Intelligence Engineer or maybe a Computer Vision Engineer role? submitted by /u/unsuccessful_boy [visit reddit] [comments ..read more
Visit website
[R] Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding
Reddit » Machine Learning
by /u/SeawaterFlows
6h ago
Paper: https://arxiv.org/abs/2404.08698 Abstract: While Large Language Models (LLMs) have shown remarkable abilities, they are hindered by significant resource consumption and considerable latency due to autoregressive processing. In this study, we introduce Adaptive N-gram Parallel Decoding (ANPD), an innovative and lossless approach that accelerates inference by allowing the simultaneous generation of multiple tokens. ANPD incorporates a two-stage approach: it begins with a rapid drafting phase that employs an N-gram module, which adapts based on the current interactive context, followed b ..read more
Visit website
[D] Old Paper - Troubling Trends in Machine Learning Scholarship
Reddit » Machine Learning
by /u/pyepyepie
6h ago
I just wanted to remind or introduce newcomers to this paper. I think this discussion should be re-opened since many people here actually do influence the trends of the field. https://arxiv.org/pdf/1807.03341 On a personal note (feel free to skip): Specifically, I want to point out the issue of "Mathiness", as it seems like this problem got way out of hand and most best papers of conferences suffer from it (one of the most important ML papers tried to be mathy and introduced a big mistake, I believe other papers have bigger issues but no one bothers to check it). So here are my personal point ..read more
Visit website

Follow Reddit » Machine Learning on FeedSpot

Continue with Google
Continue with Apple
OR