Skip to content
Snippets Groups Projects
Commit 978ce38e authored by Marie Adler's avatar Marie Adler
Browse files

Update README.md

parent 7a3a8180
No related branches found
No related tags found
No related merge requests found
...@@ -48,7 +48,8 @@ The program uses the GOV web service to retrieve information on individual locat ...@@ -48,7 +48,8 @@ The program uses the GOV web service to retrieve information on individual locat
## Parallelization: ## Parallelization:
The processing of GEDCOM files runs simultaneously to increase speed. For this purpose, you can specify how many computer cores are used. To do this, the "Pool()" parameter must be changed in the main. If it remains empty, all available calculation engines are used. In the script, the number of cores is set by default to use all available cores. The processing of GEDCOM files runs simultaneously to increase speed. For this purpose, you can specify how many computer cores are used. To do this, the "Pool()" parameter must be changed in the main. If it remains empty, all available calculation engines are used. In the script, the number of cores is set by default to use all available cores.
## Province assignment: The location details are assigned to different provinces. By default, provinces before 1871 and after 1990 are assigned in the "provincefinder.py" file. Provincial assignment is not possible for the period in between. However, this can be adapted and extended as needed. The reference time can be changed in the Main in the "parallel()" function using the referencetime variable. It is set to the year 1800 by default. ## Province assignment:
The location details are assigned to different provinces. By default, provinces before 1871 and after 1990 are assigned in the "provincefinder.py" file. Provincial assignment is not possible for the period in between. However, this can be adapted and extended as needed. The reference time can be changed in the Main in the "parallel()" function using the referencetime variable. It is set to the year 1800 by default.
## Cluster: ## Cluster:
The clustering of locations plays an important role in their identification. The minimum distance and the minimum number of locations in a cluster can be varied. The minimum distance between two clusters can be changed in the "qualitychecker.py" file using the "qualityChecker()" function via the IF query "if distance <= 50:". In the same function there is the variable minimumClusterSize, which can be used to vary the minimum size of a cluster. This is set to 6 locations by default. The clustering of locations plays an important role in their identification. The minimum distance and the minimum number of locations in a cluster can be varied. The minimum distance between two clusters can be changed in the "qualitychecker.py" file using the "qualityChecker()" function via the IF query "if distance <= 50:". In the same function there is the variable minimumClusterSize, which can be used to vary the minimum size of a cluster. This is set to 6 locations by default.
...@@ -60,6 +61,6 @@ The file "provincesdict.csv" contains four columns: The unchanged location name ...@@ -60,6 +61,6 @@ The file "provincesdict.csv" contains four columns: The unchanged location name
The "placefinder.csv" file contains information on the ID (GOV-ID), the coordinates, information on how the assignment to the GOV-ID took place, the corrected version of the place name, the original place name and the name of the file in which the name appears for each place name. The "placefinder.csv" file contains information on the ID (GOV-ID), the coordinates, information on how the assignment to the GOV-ID took place, the corrected version of the place name, the original place name and the name of the file in which the name appears for each place name.
Jan Michael Goldberg, June 30, 2022 *(translation by Marie Adler, January 09, 2024)* Jan Michael Goldberg, June 30, 2022 *(translated by Marie Adler, January 09, 2024)*
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment