Incomplete source code naturally emerges in software development: during design phase, while evolving, testing and analyzing programs. Therefore, the ability to understand partial programs is a valuable asset. However, this problem is still unsolved in the C programming language. Difficulties stem from the fact that parsing C requires, not only syntax, but also semantic information. Furthermore, inferring types so that they respect C’s type system is a challenging task. In this paper we present a technique the lets us solve these problems. We provide a unification-based type inference capable of dealing with C intricacies. The ideas we present let us reconstruct partial C programs into complete well-typed ones. Such program reconstruction has several applications: enabling static-analysis tools in scenarios where software components may be absent; improving static-analysis tools that do not rely on build-specifications; allowing stub-generation and testing tools to work on snippets; and assisting programmers on the extraction of reusable data-structures out of the program parts that use them. Our evaluation is performed on source code from a variety of C libraries such as GNU’s Coreutils, GNULib, GNOME’s GLib, GDSL, and igraph; on implementations from Sedgewick’s books; and on snippets from popular open-source projects like CPython, FreeBSD, and Git.
Thu 11 Jan Times are displayed in time zone: (GMT-07:00) Tijuana, Baja California change
|10:30 - 10:55|
Leandro T. C. MeloUFMG, Rodrigo Geraldo RibeiroUFOP, Marcus Rodrigues de AraújoUFMG, Fernando Magno Quintão PereiraUFMGPre-print
|10:55 - 11:20|
|11:20 - 11:45|
|11:45 - 12:10|