Code Generation Using LLM

Formal Reasoning Meets LLMs: Toward AI for Mathematics and Verification

Print Join the Discussion View in the ACM Digital Library The mathematical reasoning performed by LLMs is fundamentally different from the rule-based symbolic methods in traditional formal reasoning.

Want local vibe coding? This AI stack replaces Claude Code and Codex - and it's free

And then there's agentic AI coding. When a tool can help you do four years of product development in four days, the impact is world-changing. While vibe coding has its detractors (for good reason), AI ...

InfoWorld

Building AI agents with the GitHub Copilot SDK

The GitHub Copilot SDK turns the Copilot CLI into a cross-platform agent host with Model Context Protocol support.

InfoWorld

AI use may speed code generation, but developers’ skills suffer

As companies move to more AI code writing, humans may not have the necessary skills to validate and debug the AI-written code if their skill formation was inhibited by using AI in the first place, ...

IEEE

RTLBench: A Multi-Dimensional Benchmark Suite for Evaluating LLM-Generated RTL Code

Abstract: The rapid advancement of large language models (LLMs) has enabled automated Register Transfer Level (RTL) code generation, accelerating chip design workflows. However, existing benchmarks ...

IEEE

Spec2RTL-Agent: Automated Hardware Code Generation from Complex Specifications Using LLM Agent Systems

Abstract: Despite recent progress in generating hardware register transfer level (RTL) code with large language models (LLMs), existing solutions still suffer from a substantial gap between practical ...

GitHub

DeepCode: Open Agentic Coding

We evaluate DeepCode on the PaperBench benchmark (released by OpenAI), a rigorous testbed requiring AI agents to independently reproduce 20 ICML 2024 papers from scratch. The benchmark comprises 8,316 ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results