pull_wos.Rd
pull_wos
wraps the process of querying, downloading, parsing, and
processing Web of Science data.
pull_wos(query, editions = c("SCI", "SSCI", "AHCI", "ISTP", "ISSHP", "BSCI", "BHCI", "IC", "CCR", "ESCI"), sid = auth(Sys.getenv("WOS_USERNAME"), Sys.getenv("WOS_PASSWORD")), ...)
query | Query string. See the WoS query documentation page for details on how to write a query as well as this list of example queries. |
---|---|
editions | Web of Science editions to query. Possible values are listed here. |
sid | Session identifier (SID). The default setting is to get a fresh
SID each time you query WoS via a call to |
... | Arguments passed along to |
A list of the following data frames:
A data frame where each row corresponds to a different
publication. Note that each publication has a distinct ut
. There is
a one-to-one relationship between a ut
and each of the columns
in this table.
A data frame where each row corresponds to a different
publication/author pair (i.e., a ut
/author_no
pair). In
other words, each row corresponds to a different author on a publication.
You can link the authors in this table to the address
and
author_address
tables to get their addresses (if they exist). See
example in FAQs for details.
A data frame where each row corresponds to a different
publication/address pair (i.e., a ut
/addr_no
pair). In
other words, each row corresponds to a different address on a publication.
You can link the addresses in this table to the author
and
author_address
tables to see which authors correspond to which
addresses. See example in FAQs for details.
A data frame that specifies which authors correspond
to which addresses on a given publication. This data frame is meant to
be used to link the author
and address
tables together.
A data frame where each row corresponds to a different
publication/jsc (journal subject category) pair. There is a many-to-many
relationship between ut
's and jsc
's.
A data frame where each row corresponds to a different publication/keyword pair. These are the author-assigned keywords.
A data frame where each row corresponds to a different publication/keywords_plus pair. These keywords are the keywords assigned by Clarivate Analytics through an automated process.
A data frame where each row corresponds to a different
publication/grant agency/grant ID triplet. Not all publications acknowledge
a specific grant number in the funding acknowledgement section, hence the
grant_id
field can be NA
.
A data frame where each row corresponds to a different publication/document type pair.
# NOT RUN { sid <- auth("your_username", password = "your_password") pull_wos("TS = (dog welfare) AND PY = 2010", sid = sid) # Re-use session ID. This is best practice to avoid throttling limits: pull_wos("TI = \"dog welfare\"", sid = sid) # Get fresh session ID: pull_wos("TI = \"pet welfare\"", sid = auth("your_username", "your_password")) # It's best to see how many records your query matches before actually # downloading the data. To do this, call query_wos before running pull_wos: query <- "TS = ((cadmium AND gill*) NOT Pisces)" query_wos(query, sid = sid) # shows that there are 1,611 matching publications pull_wos(query, sid = sid) # }