-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathDESCRIPTION
39 lines (39 loc) · 1.26 KB
/
DESCRIPTION
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
Package: c4meta
Type: Package
Title: Metadata About the Colossal Clean Crawled Corpus
Version: 0.0.0.9000
Authors@R: c(
person(given = "Jon",
family = "Harmon",
role = c("aut", "cre"),
email = "[email protected]",
comment = c(ORCID = "0000-0003-4781-4346")),
person(given = "Jonathan",
family = "Bratt",
role = c("aut"),
email = "[email protected]",
comment = c(ORCID = "0000-0003-2859-0076")),
person(given = "Bedford Freeman & Worth Pub Grp LLC DBA Macmillan Learning",
role = c("cph")),
person(given = "AllenAI", role = c("cph"),
comment = "The processed C4 dataset.")
)
Description: Google's Colossal Clean Crawled Corpus (C4) is an extremely large
corpus of text scraped from the internet. This package provides analyses
of the text contained in C4.
License: GPL (>= 3)
Encoding: UTF-8
LazyData: true
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.1.1
Suggests:
knitr,
piecemaker,
rmarkdown,
testthat (>= 3.0.0)
Config/testthat/edition: 3
Depends:
R (>= 2.10)
URL: https://github.com/macmillancontentscience/c4meta
BugReports: https://github.com/macmillancontentscience/c4meta/issues
VignetteBuilder: knitr