/r/MachineLearning

Year:

Only show posts with narrations

[R] How Well Can a Long Sequence Model Model Long Sequences? Comparing Architectural Inductive Biases on Long-Context Abilities

6 upvotes • StartledWatermelon

Mark as read: Add to a list

[D] Forum for Machine Learning

6 upvotes • satori_paper

Mark as read: Add to a list

[Project] - how to showcase reasoning for model missing prediction

6 upvotes • Environmental_Pop686

Mark as read: Add to a list

[D] Implementing the Progressive GAN

6 upvotes • throwaway16362718383

Mark as read: Add to a list

[D] Exploring SELF-ROUTE: A Hybrid Approach to Efficient Long Context Question-Answering

6 upvotes • Desperate-Homework-2

Mark as read: Add to a list

[D] Trying to replace Diffusion with Gradient Descent in Flux

6 upvotes • LahmacunBear

Mark as read: Add to a list

Large dataset of large files on ec2 [D]

5 upvotes • SuperbMonk4403

Mark as read: Add to a list

[R] Open Set Recognition SOTA

5 upvotes • Background_Camel_711

Mark as read: Add to a list

CoRL 2024 reviews [Discussion]

5 upvotes • oz_zey

Mark as read: Add to a list

[D] Help scaling LLM inference on Azure Kubernetes

5 upvotes • chulpichochos

Mark as read: Add to a list

Title	Upvotes	Author	Mark as read	Favorited	Rating	Add to a list
[R] How Well Can a Long Sequence Model Model Long Sequences? Comparing Architectural Inductive Biases on Long-Context Abilities	6	StartledWatermelon
[D] Forum for Machine Learning	6	satori_paper
[Project] - how to showcase reasoning for model missing prediction	6	Environmental_Pop686
[D] Implementing the Progressive GAN	6	throwaway16362718383
[D] Exploring SELF-ROUTE: A Hybrid Approach to Efficient Long Context Question-Answering	6	Desperate-Homework-2
[D] Trying to replace Diffusion with Gradient Descent in Flux	6	LahmacunBear
Large dataset of large files on ec2 [D]	5	SuperbMonk4403
[R] Open Set Recognition SOTA	5	Background_Camel_711
CoRL 2024 reviews [Discussion]	5	oz_zey
[D] Help scaling LLM inference on Azure Kubernetes	5	chulpichochos