Agentic AI for Simulations Workflows
Vadim Elisseev, Robert Firth, et al.
SC 2025
Recent advances in large language models (LLMs) have led to substantial improvements in automatic code translation across programming languages. However, translating large or complex legacy code—such as monolithic COBOL programs—remains a challenge due to the limited context window of transformer-based models and their diminished ability to preserve long-range dependencies. In this work, we consider COBOL-to-Java translation as a case study and propose a hierarchical chunking strategy that decomposes long code segments into smaller, semantically coherent chunks, each fitting within a predefined Lines of Code (LoC) constraint. We formalize the problem of minimizing the number of chunks under an LoC constraint and present both a greedy heuristic and a dynamic programming based lower bound. Experiments on real-world COBOL applications demonstrate that our approach significantly improves the translation quality compared to non-chunked baselines.
Vadim Elisseev, Robert Firth, et al.
SC 2025
Chih-kai Ting, Karl Munson, et al.
AAAI 2023
Sahil Suneja, Yufan Zhuang, et al.
ACM TOSEM
Anamitra R. Choudhury, Ramachandra Kota, et al.
e-Energy 2016