BRAIn: Bayesian Reward-conditioned Amortized INference for natural language generation from feedbackGaurav PandeyYatin Nandwaniet al.2024ICML 2024
Few shot chain-of-thought driven reasoning to prompt LLMs for open-ended medical question answeringSaeel Sandeep NachaneOjas Gramopadhyeet al.2024EMNLP 2024