Ryutaro Nishizaka, Yudai Fujiwara, Takuya Shimizu, Kazushi Kato, Yuichi Sugiyama (Ricerca Security, Inc.)

LLM agents that autonomously operate tools such as disassemblers and debuggers are increasingly used for reverse engineering. Designing LLM-resistant protections requires understanding their capability characteristics, yet prior work has not studied this systematically. We propose an analytical model linking a three-stage loop (Observe–Comprehend–Plan) to three categories of software protection (Concealment–Complication– Misdirection) and evaluate three LLM agents on 24 CTF reverse engineering tasks. By analyzing failure logs, we identify four weaknesses (Training bias, Over-trust in observations, Context limitation, Plan persistence) and show that different software protections disrupt different stages and expose different weaknesses. We also find that LLM agents often analyze assembly effectively without a decompiler, and that their strengths differ from human solvers depending on challenge characteristics.

View More Papers

Huma: Censorship Circumvention via Web Protocol Tunneling with Deferred...

Sina Kamali (University of Waterloo), Diogo Barradas (University of Waterloo)

Read More

Auditable LLM Arbiter for DeFi Security: A Hybrid Graph-of-Thoughts...

Duanyi Yao (Navalabs), Siddhartha Jagannath (Navalabs), Baltasar Aroso (Navalabs), Vyas Krishnan (Navalabs), Ding Zhao (Navalabs)

Read More

Towards Parallel Binary Code Analysis

Xiaozhu Meng (University of Wisconsin-Madison)

Read More