manage geo_sirene
Management of geo_sirene database
geo_sirene is a DB that describes the economic activities for France. It is maintained by data.cquest. It results from a post-treatment of sirene data which is maintained by INSEE, the post-treatment resulting in adding geo-localisation information from the addresses given in sirene DB.
There are tow versions for sirene DB that are names as 2017 and 2019 in https://data.cquest.org/geo_siren :
- monthly vintage since 2017-12 for version 2017
- monthly vintage since 2018-10 for version 2019
Original data can be downloaded at this url: https://files.data.gouv.fr/insee-sirene (monthly vintage since 2018-10, that should be in version 2019 so).
NEW Need to have a look to https://files.data.gouv.fr/geo-sirene/ !!!
Functions/dataset to be added
-
download_geo_sirene - description download the data from data.cquest as it where it is produced.
- adaptation of download.geo_sirene
- use new journal
- output archive in original version
-
adapt_geo_sirene - description format the data from official repository in aimed data based on scheme_sirene_2019
- adaptation of adapt.geo_sirene
- use new journal
- output: archive in formatted version
-
scheme_sirene_2019 and scheme_sirene_na - description global data to be used for formatting data
- verification of what has already be done, possibly adaptation of scheme_sirene_na
- see if there is a need for scheme_sirene_2017
-
analyse_geo_sirene - description some analyses to be performed after downloading and formatting.
- to be defined (totally)
-
vignettes/geo_sirene - description presentation of the data
For each function/dataset. Please follow this procedure
-
creation of function
- code of function
- documentation of function
- example for function
- test for function
-
creation of dataset
- script for creation of "raw" data (if needed)
- storage of raw data in ascii format, normally in inst/extdata
- script in to maintain data in /data-raw
- documentation of data