A study on robustness and reliability of large language model code generationarxiv.org176 pointsfloridsleeves3 years ago