forked from JabRef/jabref
-
Notifications
You must be signed in to change notification settings - Fork 15
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Implimented arXivId Parsing for PDF with arXivId (JabRef#12335)
* Implimented arXivId Parsing forPDF with arXivId * added Optional parameter * merged fixes * removed csl-styles * fixed null arxiv issue on external imports * Improved getArxivId Implementation * reduced nesting and added arxiv constant * reduced nesting * modified testcase * fix abbrev repo * removed unnecessary 'if' clause * Use EPRINTTYPE Co-authored-by: Carl Christian Snethlage <[email protected]> Co-authored-by: Christoph <[email protected]> Co-authored-by: Ruslan <[email protected]> Co-authored-by: Subhramit Basu Bhowmick <[email protected]> * Update src/main/java/org/jabref/logic/importer/fileformat/PdfContentImporter.java Co-authored-by: Ruslan <[email protected]> * WIP Co-authored-by: Carl Christian Snethlage <[email protected]> Co-authored-by: Christoph <[email protected]> Co-authored-by: Ruslan <[email protected]> Co-authored-by: Subhramit Basu Bhowmick <[email protected]> * Add completion using arXiv ID Co-authored-by: Carl Christian Snethlage <[email protected]> Co-authored-by: Christoph <[email protected]> Co-authored-by: Ruslan <[email protected]> Co-authored-by: Subhramit Basu Bhowmick <[email protected]> * Adapt test case Co-authored-by: Carl Christian Snethlage <[email protected]> Co-authored-by: Christoph <[email protected]> Co-authored-by: Ruslan <[email protected]> Co-authored-by: Subhramit Basu Bhowmick <[email protected]> * Fix test name * "Fix2 number and year extraction for arXiv --------- Co-authored-by: Siedlerchr <[email protected]> Co-authored-by: Oliver Kopp <[email protected]> Co-authored-by: Carl Christian Snethlage <[email protected]> Co-authored-by: Ruslan <[email protected]> Co-authored-by: Subhramit Basu Bhowmick <[email protected]>
- Loading branch information
1 parent
add35ab
commit 4aa313d
Showing
4 changed files
with
83 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -129,6 +129,44 @@ British Journal of Nutrition (2008), 99, 1–11 doi: 10.1017/S0007114507795296 | |
assertEquals(Optional.of(entry), importer.getEntryFromPDFContent(firstPageContent, "\n", Optional.empty())); | ||
} | ||
|
||
@Test | ||
void extractArXivFromPage() { | ||
BibEntry entry = new BibEntry(StandardEntryType.TechReport) | ||
.withField(StandardField.AUTHOR, "Filippo Riccaa and Alessandro Marchettob and Andrea Stoccoc") | ||
.withField(StandardField.TITLE, "A Multi-Year Grey Literature Review on AI-assisted Test Automation") | ||
.withField(StandardField.EPRINT, "2408.06224v1") | ||
.withField(StandardField.EPRINTTYPE, "arXiv") | ||
.withField((StandardField.KEYWORDS), "Test Automation Artificial Intelligence AI-assisted Test Automation Grey Literature Automated Test Generation Self-Healing Test Scripts"); | ||
|
||
// This is from https://arxiv.org/abs/2408.06224 | ||
String firstPageContent = """ | ||
A Multi-Year Grey Literature Review on AI-assisted Test Automation | ||
Filippo Riccaa, Alessandro Marchettob and Andrea Stoccoc | ||
aUniversity of Genoa, Via Balbi 5, Genova, 16126, Italy | ||
bUniversity of Trento, Via Sommarive 9, Trento, 38123, Italy | ||
cTechnical University of Munich, Boltzmannstraße 3, Munich, 85748, Germany | ||
dfortiss GmbH, Guerickestraße 25, Munich, 80805, Germany | ||
Keywords: | ||
Test Automation | ||
Artificial Intelligence | ||
AI-assisted Test Automation | ||
Grey Literature | ||
Automated Test Generation | ||
Self-Healing Test Scripts | ||
*Corresponding author | ||
[email protected] (F. Ricca) | ||
https://person.dibris.unige.it/ricca-filippo/ (F. Ricca) | ||
ORCID(s): 0000-0002-3928-5408 (F. Ricca); 0000-0002-6833-896X (A. Marchetto); 0000-0001-8956-3894 (A. Stocco) | ||
arXiv:2408.06224v1 [cs.SE] 12 Aug 2024"""; | ||
|
||
assertEquals(Optional.of(entry), importer.getEntryFromPDFContent(firstPageContent, "\n", Optional.empty())); | ||
} | ||
|
||
@ParameterizedTest | ||
@MethodSource("providePdfData") | ||
void pdfTitleExtraction(String expectedTitle, String filePath) throws Exception { | ||
|