Large Language Models as Optimizers
Paper
• 2309.03409
• Published • 79
FLM-101B: An Open LLM and How to Train It with $100K Budget
Paper
• 2309.03852
• Published • 45
GPT Can Solve Mathematical Problems Without a Calculator
Paper
• 2309.03241
• Published • 19
DoLa: Decoding by Contrasting Layers Improves Factuality in Large
Language Models
Paper
• 2309.03883
• Published • 36
ImageBind-LLM: Multi-modality Instruction Tuning
Paper
• 2309.03905
• Published • 17
Textbooks Are All You Need II: phi-1.5 technical report
Paper
• 2309.05463
• Published • 90
NExT-GPT: Any-to-Any Multimodal LLM
Paper
• 2309.05519
• Published • 79
When Less is More: Investigating Data Pruning for Pretraining LLMs at
Scale
Paper
• 2309.04564
• Published • 17
Efficient Memory Management for Large Language Model Serving with
PagedAttention
Paper
• 2309.06180
• Published • 50
Language Modeling Is Compression
Paper
• 2309.10668
• Published • 85
Multimodal Foundation Models: From Specialists to General-Purpose
Assistants
Paper
• 2309.10020
• Published • 41