Abstract
Repository mining research is a data-intensive domain with a focus on source code. There are many ways to search for code in the worldwide software ecosystem, but these search methods are inefficient and only cover small parts of the software ecosystem. One of the problems is granularity: it is possible to search through code on a file-level and cover a significant part of the software ecosystem or search for a line of code and only cover a small part of the software ecosystem, but not both. We propose SearchSECO: a language-agnostic search engine and research platform that searches through abstract representations of source code methods. We use SearchSECO to search across the worldwide software ecosystem and index the encountered methods. With SearchSECO, the field is advanced because it (1) provides finer-grained and more efficient searches, (2) covers more of the software ecosystem than other search mechanisms, and (3) provides mechanisms for source code provenance.
Original language | English |
---|---|
Journal | CEUR Workshop Proceedings |
Volume | 2912 |
Publication status | Published - 2020 |
Event | 19th Belgium-Netherlands Software Evolution Workshop, BENEVOL 2020 - Luxembourg, Luxembourg Duration: 3 Dec 2020 → 4 Dec 2020 |
Bibliographical note
Funding Information:ACKNOWLEDGMENT The SearchSECO project, which is part of the SecureSECO initiative, is currently supported by Gitlab, the Software Improvement Group (SIG), the Lisk Center, Cisco, Centric, and KPN Security. See SecureSECO.org/partners for more information. SecureSECO is endorsed by the Secure Software Alliance, the Vereniging Software Engineering Nederland (Versen.nl), and the Blockchain Coalition. We are also in contact with the eSciences center, SurfSARA, and DANS, whom we see as strategic partners in designing our computing and data storage resources.
Funding Information:
The SearchSECO project, which is part of the SecureSECO initiative, is currently supported by Gitlab, the Software Improvement Group (SIG), the Lisk Center, Cisco, Centric, and KPN Security. See SecureSECO.org/partners for more information. SecureSECO is endorsed by the Secure Software Alliance, the Vereniging Software Engineering Nederland (Versen.nl), and the Blockchain Coalition. We are also in contact with the eSciences center, SurfSARA, and DANS, whom we see as strategic partners in designing our computing and data storage resources. We would like to thank all the SecureSECO student team members for their ideas in this paper: Venja Beck, Floris Jansen, Fang Hou, Elena Baninemeh, Luuk van Driel, Jozef Siu, Swayam Shah, Donny Groeneveld, and Tom Peirs.
Publisher Copyright:
Copyright 2020 for this paper by its authors.
Funding
ACKNOWLEDGMENT The SearchSECO project, which is part of the SecureSECO initiative, is currently supported by Gitlab, the Software Improvement Group (SIG), the Lisk Center, Cisco, Centric, and KPN Security. See SecureSECO.org/partners for more information. SecureSECO is endorsed by the Secure Software Alliance, the Vereniging Software Engineering Nederland (Versen.nl), and the Blockchain Coalition. We are also in contact with the eSciences center, SurfSARA, and DANS, whom we see as strategic partners in designing our computing and data storage resources. The SearchSECO project, which is part of the SecureSECO initiative, is currently supported by Gitlab, the Software Improvement Group (SIG), the Lisk Center, Cisco, Centric, and KPN Security. See SecureSECO.org/partners for more information. SecureSECO is endorsed by the Secure Software Alliance, the Vereniging Software Engineering Nederland (Versen.nl), and the Blockchain Coalition. We are also in contact with the eSciences center, SurfSARA, and DANS, whom we see as strategic partners in designing our computing and data storage resources. We would like to thank all the SecureSECO student team members for their ideas in this paper: Venja Beck, Floris Jansen, Fang Hou, Elena Baninemeh, Luuk van Driel, Jozef Siu, Swayam Shah, Donny Groeneveld, and Tom Peirs.