Specializing Neural Networks for Cryptographic Code Completion Applications

Ya Xiao, Wenjia Song, Jingyuan Qi, Bimal Viswanath, Patrick McDaniel, Danfeng Yao

Research output: Contribution to journalArticlepeer-review

Abstract

Similarities between natural languages and programming languages have prompted researchers to apply neural network models to software problems, such as code generation and repair. However, program-specific characteristics pose unique prediction challenges that require the design of new and specialized neural network solutions. In this work, we identify new prediction challenges in application programming interface (API) completion tasks and find that existing solutions are unable to capture complex program dependencies in program semantics and structures. We design a new neural network model Multi-HyLSTM to overcome the newly identified challenges and comprehend complex dependencies between API calls. Our neural network is empowered with a specialized dataflow analysis to extract multiple global API dependence paths for neural network predictions. We evaluate Multi-HyLSTM on 64,478 Android Apps and predict 774,460 Java cryptographic API calls that are usually challenging for developers to use correctly. Our Multi-HyLSTM achieves an excellent top-1 API completion accuracy at 98.99%. Moreover, we show the effectiveness of our design choices through an ablation study and have released our dataset.

Original languageEnglish (US)
Pages (from-to)3524-3535
Number of pages12
JournalIEEE Transactions on Software Engineering
Volume49
Issue number6
DOIs
StatePublished - Jun 1 2023

All Science Journal Classification (ASJC) codes

  • Software

Fingerprint

Dive into the research topics of 'Specializing Neural Networks for Cryptographic Code Completion Applications'. Together they form a unique fingerprint.

Cite this