troopdata
:
Tools for Analyzing Cross-National Military Deployment and Basing
DataThe goal of the {troopdata}
package is to facilitate the
distribution of military deployment and basing data for use in social
science research and journalism. The troop deployment data were
initially compiled by Tim Kane using information obtained from the U.S.
Department of Defense’s Defense Manpower Data Center (DMDC). The
original data ended in 2005 and we have updated it to run through
2024.
Similarly, the basing data were initially compiled by David Vine, and we have updated the original data using open source information from the U.S. military and press reports through 2018. We have also assembled this R package to allow users to more easily access the data and use it in their own research.
The package will be updated with additional features in the future, but for now please let me know if you find any errors.
Please refer to the bottom of this page for citation information.
You can also find more information on the package and changes corresponding to each update here: https://meflynn.github.io/troopdata/index.html
You can install the troopdata
package from CRAN or GitHub with:
#install.packages("devtools")
install.packages("troopdata")
or
::install_github("meflynn/troopdata") devtools
This package currently has two functions:
get_troopdata()
: Returns a data frame containing U.S.
military deployment values. Depending on the arguments specified the
function returns total troop deployments, or total deployments plus
service branch-specific deployment values, guard and reserve values, and
DoD civilian values. Users can specify select countries and years, or
call the entire data frame.
get_basedata()
: Returns a data frame containing
information on U.S. military bases around the globe from the Cold War
forward. Depending on the arguments specified the function will return
the entire data set or data for a particular country. Observations can
be site-specific or can be aggregated to generate country counts.
get_builddata()
: Returns a data frame containing
geocoded location-year information on U.S. military overseas
construction spending. Users can specify select countries and years, or
call the entire data frame. Currently only select countries are included
in the data
You can find more detailed vignettes on these functions below:
The original DMDC data contain information on U.S. troop deployments to a a wide range of locations, including several non-state territories and subnational units (e.g. Okinawa). One downside of using the COW country codes as the primary host ID variable is that there are often no country codes for smaller states and territories. In the case of the Vine basing data, some smaller territories have COW codes for the imperial power that controls a territory. For example, Puerto Rico and Guam both receive a COW country code of 2 as they are territorial possessions of the United States. Users may want to distinguish such cases where deployments are present in a territory versus the metropole. Using the ISO country codes provides some additional flexibility when calling the data. Worst case, you can pull the full data frame and look around at the specific observations and figure out what best suits your needs.
When using the updated troop deployment data and/or the
troopdata
package please cite the following:
Kane’s original troop deployment data collected from 1950-2005:
Vine’s original basing data:
Construction data