Paper page - A Static Evaluation of Code Completion by Large Language ...
Underline | A Static Evaluation of Code Completion by Large Language Models
[2306.03203] A Static Evaluation of Code Completion by Large Language ...
Table 1 from A Static Evaluation of Code Completion by Large Language ...
A Static Evaluation of Code Completion by Large Language Models | DeepAI
Figure 1 from A Static Evaluation of Code Completion by Large Language ...
Table 3 from A Static Evaluation of Code Completion by Large Language ...
Paper page - A Systematic Evaluation of Large Language Models of Code
The Program Testing Ability of Large Language Models for Code - ACL ...
(PDF) An Empirical Evaluation of Large Language Models in Static Code ...
ICE-Score: Instructing Large Language Models to Evaluate Code - ACL ...
CodeJudge: Evaluating Code Generation with Large Language Models - ACL ...
Evaluating the Performance of Large Language Models via Debates - ACL ...
Personality-Guided Code Generation Using Large Language Models - ACL ...
On Improving Repository-Level Code QA for Large Language Models - ACL ...
(PDF) A Systematic Survey on Large Language Models for Static Code Analysis
Analyzing the Performance of Large Language Models on Code ...
A Closer Look into Using Large Language Models for Automatic Evaluation ...
Large Language Models Meet NL2Code: A Survey - ACL Anthology
Evaluating Large Language Models on Controlled Generation Tasks - ACL ...
Aligning Large Language Models for Controllable Recommendations - ACL ...
DCE-LLM: Dead Code Elimination with Large Language Models - ACL Anthology
Exploring the Potential of Large Language Models in Generating Code ...
Evaluating the Long-Term Memory of Large Language Models - ACL Anthology
Mitigating the Bias of Large Language Model Evaluation - ACL Anthology
A Closer Look Into Automatic Evaluation Using Large Language Models ...
Fine-tuning Language Models for Joint Rewriting and Completion of Code ...
Calibrating Large Language Models Using Their Generations Only - ACL ...
A Survey of using Large Language Models for Generating Infrastructure ...
Continual Learning of Large Language Models - ACL Anthology
Better Language Models of Code through Self-Improvement - ACL Anthology
Large Language Models in Bioinformatics: A Survey - ACL Anthology
Optimizing Large Language Models for OpenAPI Code Completion | AI ...
Empowering Large Language Models for Textual Data Augmentation - ACL ...
Interactive Evaluation of Large Language Models for Multi-Requirement ...
Explanation in the Era of Large Language Models - ACL Anthology
SafetyBench: Evaluating the Safety of Large Language Models - ACL Anthology
ConCodeEval: Evaluating Large Language Models for Code Constraints in ...
(PDF) Evaluating and Explaining Large Language Models for Code Using ...
Benchmarking Generation and Evaluation Capabilities of Large Language ...
(PDF) A Survey on Evaluating Large Language Models in Code Generation Tasks
Can You Really Trust Code Copilot? Evaluating Large Language Models ...
Anchor-based Large Language Models - ACL Anthology
Automated Creativity Evaluation for Large Language Models: A Reference ...
InstructCoder: Instruction Tuning Large Language Models for Code ...
Large Language Models with Controllable Working Memory - ACL Anthology
Evaluating Large Language Models with Enterprise Benchmarks - ACL Anthology
Figure 1 from Code completion with statistical language models ...
CodeJudge-Eval: Can Large Language Models be Good Judges in Code ...
VisualCoder: Guiding Large Language Models in Code Execution with Fine ...
A Comprehensive Review of Large Language Models for.pptx
LongCoder: A Long-Range Pre-trained Language Model for Code Completion ...
Challenging Large Language Models with New Tasks: A Study on their ...
How Abilities in Large Language Models are Affected by Supervised Fine ...
Robust and Scalable Model Editing for Large Language Models - ACL Anthology
Exploring the Reliability of Large Language Models as Customized ...
A Survey of Confidence Estimation and Calibration in Large Language ...
ReACC: A Retrieval-Augmented Code Completion Framework - ACL Anthology
Improving and Assessing the Fidelity of Large Language Models Alignment ...
Enhancing Semantic Consistency of Large Language Models through Model ...
Code Execution with Pre-trained Language Models - ACL Anthology
CI/CD evaluation of Large Language Models using OpenEvals
Exploring the Potential of Large Language Models in Computational ...
Evaluating Large Language Models Trained on Code - 知乎
[PDF] Evaluating Large Language Models Trained on Code | Semantic Scholar
TAIL: A Toolkit for Automatic and Realistic Long-Context Large Language ...
Characterizing the Confidence of Large Language Model-Based Automatic ...
(PDF) Evaluating Large Language Models Trained on Code
(PDF) Evaluating Large Language Models for Functional and Maintainable ...
PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task ...
Rewriting the Code: A Simple Method for Large Language Model Augmented ...
(PDF) Large Language Models for Code Summarization
Filter-then-Generate: Large Language Models with Structure-Text Adapter ...
Large Language Models Can Self-Correct with Key Condition Verification ...
LongReward: Improving Long-context Large Language Models with AI ...
WaveCoder: Widespread And Versatile Enhancement For Code Large Language ...
DocChecker: Bootstrapping Code Large Language Model for Detecting and ...
Towards Better Value Principles for Large Language Model Alignment: A ...
Large Language Models Are State-of-the-Art Evaluator for Grammatical ...
Large Language Models for Code: Security Hardening and Adversarial ...
Aligning Large Language Models to Follow Instructions and Hallucinate ...
(PDF) Improving Code Completion by Solving Data Inconsistencies in the ...
Evaluating Large Language Models on Wikipedia-Style Survey Generation ...
Enhancing Large Language Models in Coding Through Multi-Perspective ...
A Survey on Efficient Large Language Model Training: From Data-centric ...
(PDF) Towards Full-line Code Completion with Neural Language Models
Exploring Large Language Models for Knowledge Graph Completion | DeepAI
Large Language Models Can be Lazy Learners: Analyze Shortcuts in In ...
Multi-perspective Improvement of Knowledge Graph Completion with Large ...
PPTC-R benchmark: Towards Evaluating the Robustness of Large Language ...
[논문 리뷰] Evaluating Large Language Models for Code Review
Multilingual Knowledge Graph Completion from Pretrained Language Models ...
Enhancing Large Language Model for Knowledge Graph Completion via ...
Evaluating Large Language Models as Generative User Simulators for ...
Large Language Models Can Learn Representation in Natural Language ...
CodeReviewQA: The Code Review Comprehension Assessment for Large ...
Evaluating Large Language Models Benchmarks & Challenges
Evaluating Large Language Models: Methods, Best Practices & Tools ...
CodeAttack: Revealing Safety Generalization Challenges of Large ...
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large ...
CodeIF: Benchmarking the Instruction-Following Capabilities of Large ...
KICGPT: Large Language Model with Knowledge in Context for Knowledge ...
L-Eval: Instituting Standardized Evaluation for Long Context Language ...
Benchmarking Large Language Model Capabilities for Conditional ...
Large Language Model Evaluation in 2026: Technical Methods & Tips
Sequence-level Large Language Model Training with Contrastive ...
OOP: Object-Oriented Programming Evaluation Benchmark for Large ...
Towards Explainable Computerized Adaptive Testing with Large Language ...
Large Language Model Evaluation in '25: 5 Methods
Comparison of Large Language Models: The Ultimate Guide
What are Large Language Models and How They Work: Explained!
Large Language Model Structure - Image to u
Large Language Model as an Assignment Evaluator: Insights, Feedback ...
Benchmarking and Improving Long-Text Translation with Large Language ...
Automatically Correcting Large Language Models: Surveying the Landscape ...
(PDF) Combining Program Analysis and Statistical Language Model for ...
KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on ...
(PDF) R2C2-Coder: Enhancing and Benchmarking Real-world Repository ...