Fusion de doublons
Cet outil de Fusion des Doublons détecte les entités dupliquéesdoublons et vous permet de les fusionner s'il s'agit de vrais doublons, ou de les confirmer comme non dupliquésdoublons sisinon.
Il existe trois façons de rechercher et de fusionner les doublons. Elles sont expliquées dans cette page.
GlobalRecherchesearchglobaleacrossàthetraverswholetoutegenealogylafilegenealogieAutomaticDétectiondetectionautomatiqueeachàtimechaqueanfoisentityqu'uneisentitémodifiedest modifiéeManualActionmergemanuelleactionde fusionfromparthel'utilisateuruserpourtoforcerforcelathefusionmergedeofdeuxtwoentitésselected entitiessélectionnées
TheLa followingfenêtre windowsuivante isest usedutilisée indans allces 3trois situations. ThisCette page describesdécrit theles componentscomposants ofde thel'outil toolet andcomment how to use it.
l'utiliser.
Description
TheL'outil Mergede DuplicatesFusion tooldes isDoublons madeest ofcomposé de 3 components.éléments.
DetectionFenêtrecriteriadeswindowCritères de Détection:thisc'estisiciwherequeyouvouscanpouvezsetdéfinirsomecertainsofdesthecritèrescriteriaquiwhichdéterminerontwillsideterminedeuxifentitéstwodoivententitiesêtreshouldconsidéréesbecommeconsidereddesduplicatesdoublonsorounot.non.MergeFenêtrewindowde Fusion:thisc'estisiciwherequeyouvouswillverrezseetousalllespotentialdoublonsduplicatepotentielsentitiesetandquedecidevouswheredéciderezordenotlestofusionnermergeouthem.non.SpecialNotenotespécialefor alldes nonduplicatesdoublons:thisc'estisiciwherequ'Ancestrisstoresstockealltoutesyourvos confirmationsthatquetwodeuxentitiesentitésarenenotsontduplicates.pas des doublons.
DetectionFenêtre criteriades window
Critères de Détection
ProbabilityIndicateur indicatorde probabilité
ItIl isest difficultdifficile tod'évaluer assessavec withune 100%certitude certaintyde that100 two% entitiesque aredeux duplicatesentités orsont aredes notdoublons duplicates.ou Evenne asont humanpas beingdes candoublons. sometimesMême haveun difficultyêtre certifyinghumain thatpeut twoparfois individualsavoir ordes entitiesdifficultés areà thecertifier certainlyque thedeux sameindividus orou certainlyentités notsont thecertainement same.les mêmes ou certainement pas les mêmes.
OfBien course,sûr, itil wouldserait beplus easierfacile tode limitlimiter thela detectiondétection toen sayingdisant thatque twodeux individualspersonnes withayant exactlyexactement thele samemême surname,nom, firstle namemême andprénom et la même date ofde birthnaissance aresont duplicates.des Indoublons. reality,En réalité, les dates couldpeuvent beêtre eithermanquantes missingou orapproximatives, beles approximate,prénoms firstpeuvent namesêtre coulddans beun inordre differentdifférent orderou or incomplete,incomplets, etc. InDans theseces cases,cas, youvous wouldvoudriez stillquand wantmême qu'Ancestris topuisse bedétecter ablequelque to detect something.chose.
ThereforeC'est pourquoi Ancestris usesutilise aun probabilityindicateur indicatorde probabilité. ThePlus morecertaines someinformations informationsont issimilaires, similar,plus theil moreest probable theque entitiesles areentités duplicates.soient des doublons.
Ancestris thendresse listsensuite thela potentialliste duplicatesdes accordingdoublons topotentiels thisen indicatorfonction inde ancet intentindicateur todans telll'intention you:de vous dire : "WhileBien thisque iscela notne soit pas certain, givenétant thedonné similaritiesles insimilitudes thedans informationles betweeninformations theseentre twoces individuals,deux theypersonnes, mightil beest duplicates.possible Andqu'elles thissoient isdes thedoublons. levelEt ofvoici confidencele thatniveau theyde areconfiance qu'il y a entre eux". ThenEnsuite, it'sc'est upà tovous youde todécider decidede offusionner whetherou mergingd'écarter orla discarding the resemblance.ressemblance.
WhatCela thissignifie meanqu'Ancestris ispeut thatvous montrer des doublons potentiels que vous considérerez comme non doublons, et inversement, Ancestris mightpeut showne youpas potentialtrouver duplicatescertains thatdoublons youqui willpourraient consideren asfait nonêtre duplicates,de andvrais reversely,doublons selon vous.
Veuillez accepter nos excuses si le détecteur n'est pas parfait et veuillez nous faire savoir si vous trouvez de tels cas.
Pour calculer cet indicateur de probabilité, Ancestris mightutilise misscertains to show you some duplicates that could actually be real duplicates according to you. Please accept our apologies if the detector is not perfect and please let us know if you find such instances.critères.
ToDans calculatela thissection probablysuivante, indicator,vous Ancestrisavez usesla somepossibilité criteria.de modifier certains d'entre eux.
In the next section you are given the possibility to change some of them.
CriteriaSélecteur selectorde critères
WhenLorsqu'une arecherche globalglobale searchde fordoublons duplicatesest islancée, launched,la thefenêtre detectiondes criteriacritères windowde isdétection displayeds'affiche withavec aun numbercertain ofnombre criteria.de critères.
CheckCochez theles entitycases boxesdes forentités whichpour youlesquelles wantvous tosouhaitez searchrechercher fordes duplicates.doublons.
OnlySeules theles boxescases ofdes entitiesentités thatprésentes aredans presentle in thefichier Gedcom filesont areactivées. enabled.Dans Inl'exemple theci-dessus, examplecomme above,il asn'y therea arepas nod'entités mediamédias, entities,le thebouton correspondingCritères Criteriacorrespondant buttonn'est ispas unavailable.disponible.
ThenVous youpouvez canalors presscliquer thesur le bouton "Criteria.Critères..." buttonpour tospécifier specifycertains somecritères ofpour thechaque criteriacatégorie for each category of entity.d'entité.
Si Invous souhaitez exclure de la recherche les doublons connus, c'est-à-dire ceux que vous avez déjà confirmés comme n'étant pas des doublons, cochez la case youcorrespondante.
Pour tovoir excludela knownliste des paires d'entités que vous avez confirmées comme n'étant pas des doublons, cliquez sur le bouton "Liste...".
Si vous appuyez sur le bouton "Liste...", la liste des entités non duplicatesdoublons from the search, i.e. those that you have already confirmed as non duplicates, check the corresponding box.s'affiche.
To see the list of pairs of entities that you have confirmed as non duplicates, press the "Show List" button.
If you press the Show List button, the list of Non Duplicates appear.
EntityCritères criteriad'Entité
TheLes mostcritères sophisticatedles criteriaplus aresophistiqués thosesont ofceux individuals.des Hereindividus. theyLes are.voici.
TheLes criteriacritères arepour asles follows.individus sont les suivants.
Les critères pour les autres entités que les individus sont soit une sous-partie de ces critères, soit ne sont pas modifiables.
IdenticalDates datesidentiques
WhenQuand are twodeux dates consideredsont-elles identical?considérées Whencomme theiridentiques difference? inLorsque numberleur ofdifférence daysen isnombre closede orjours zero.est proche ou nulle.
IfSi youvous indicateindiquez par exemple 365 daysjours, for example, i.e.c'est-à-dire 1 year,an, twodeux dates willseront beégales equalsi ifleur theirdifférence differenceest isinférieure lessà thanun a year.an.
IfSi youvous indicateindiquez 30 days,jours, twodeux dates willseront beégales equalsi ifleur theydifférence differest byinférieure lessà thanun a month.mois.
EmptyDate orvides invalidou datesinvalides
IfSi a knownune date isconnue comparedest tocomparée anà unknownune date,date inconnue, Ancestris willles considerconsidérera themcomme different.différentes.
NameÉléments elementsdu nom
ForcesOblige alltous elementsles oféléments thedu namenom toà beêtre identical.identiques. Conversely,Inversement, canpeut beêtre identicalidentique ifsi onlyseuls somecertains elementséléments ofdu thenom namesont are identical.identiques.
First namesPrénoms
ForcesOblige alltous firstles namesprénoms toà beêtre identical.identiques. Conversely,Inversement, canpeut beêtre identicalidentique ifsi onlyseulement somecertains firstprénoms namessont are identical.identiques.
Exclusion ofdes individualsindividus fromde thela samemême familyfamille
IndividualsLes fromindividus thed'une samemême siblingfratrie orou d'une même relation parent-childenfant relationshipne aresont notpas compared.comparés.
Exclusion ofdes individualspersonnes withoutsans firstnom orou last nameprénom
IndividualsLes withoutpersonnes firstqui orn'ont lastpas namesde arenom notou compared.de prénom ne sont pas comparées.
The criteria for other entities are either a sub-part of these criteria or are not modifiable.
Merge window
For entities compared as duplicates, the following window is used.
Window
This window displays one by one the total list of all potential duplicates where the probability is greater than 50%.
The list is sorted from the most certain pair of duplicates to the least certain pair of duplicates, by category of entity. For each pair of similar entities, Ancestris gives you a probability percentage.
The title of the window indicates the duplicate pair number displayed and the confidence that the two entities of this pair are in fact the same, and therefore to be merged.
A general message and displayed and depends on the situation the tool was launched, as global search, automatic detection, or manual action.
Each pair of duplicates made of two entities is displayed in the two columns.
In each column are displayed the properties of each entities of the supposed duplicate.
Values that are different are displayed in red.
Values that are identical are displayed in blue for the left hand side entity, in grey for the right hand side entity.
The purpose of the comparison is to merge the right entity into the left one if you confirm them as duplicate
Therefore, a check boxe is available for each property on the right hand side to tell Ancestris to keep both selected information of each entity after merging them.
Toolbar
Go to first duplicate Button
Displays the first duplicate of the list in the order of the confidence index, i.e. the most likely duplicate, or the duplicate which is 50 positions before the current one in case there are more than 50 duplicates in the list.
Go to previous duplicate Button
Displays the previous duplicate.
Swap Left and Right Entities Button
Swap the left and right entities in order to merge the two entities on the left one. This is useful if most of the information to be kept after the merge is on the right hand side.
Go to next duplicate Button
Displays the next duplicate.
Go to last duplicate Button
Displays the last duplicate of the list in the confidence index, therefore the least likely duplicate, or the duplicate which is 50 positions after the current one in case there are more than 50 duplicates in the list.
Remove duplicate Button
Removes the potential duplicate from the displayed list.
It is useful if you do not know yet whether the two entities are duplicates or not, and you want to postpone the decision.
If a new global duplicate search is started, the duplicate will reappear.
Merge Button
By clicking the Merge button, the entities will be merged.
The entity on the right is removed from the Gedcom file and the information which check box is checked on the right hand side is added to the entity on the left.
For information that can only exist once (e.g. birth), it is only possible to keep the information from one of the two entities.
As soon as the merge is done, the window displays the same duplicate with the result of the merge so that you can check that everything has been kept as you wanted.
You can then move on to the next duplicate.
Non duplicate Button
This button excludes the pair of entities from the potential duplicates.
It marks the entity pair to be non duplicate and stores this confirmation in the special "non duplicates" note.
Close Button
Closes the window.
Special note for all non duplicates
A special note is created and updated in Ancestris to store the non duplicate confirmations.
This note stores user confirmations of similar pairs that are actually not duplicates according to you.
It avoids Ancestris detecting them again and again each time the global search or the automatic detection is run.
This note has a reference name called "Non_Duplicates".
The note is updated each time you press the Non Duplicate button in the Merge window or update the list using the Criteria window.
We have chosen to store this user information in the Gedcom file itself because we wanted your efforts to analyse the entities and decide that they are not duplicates as a valuable piece of genealogy information that has to be kept and transferred as part of the Gedcom file.
The Gedcom standard does not cater for this need, hence Ancestris choice to store it this way, in one single note.
Should the Gedcom standard evolve, such as a NOALIAS tag, we might change the way this information would be managed.
You can see the list of confirmed non duplicates from the criteria window.
List of non duplicates
You can see the list of non duplicates by pressing the "Show list" button on the Detection Criteria window.
You can sort the lines to find the entities you are interested in.
You can select one or several lines and remove them from the list if you need to.
Usage
As mentioned above, there are 3 ways to use this tool.
- Global search across the whole genealogy file
- Automatic detection each time an entity is modified
- Manual merge action from the user to force the merge of two selected entities
Global search
The purpose of the global search is both to identify duplicates throughout the whole genealogy and act on them, that is decide one by one what you want to do with each them.
Your decision for each duplicate will then be to either
- merge the duplicate,
- declare it as a non duplicate,
- or postpone the decision to later.
You can launch the global search from the Ancestris tools menu.
The duplicate merge tool works in two steps.
- First you specify the detection criteria in the corresponding window,
- then you choose how to merge duplicates in the Merge window.
While using the tool, the genealogy is changed accordingly
- Entities you decide to merge are merged with the information you specified to keep
- Entities you declare as non duplicates are logged into the special note.
A message lets you know when you close the merge window.
Automatic detection
The purpose of the Automatic Detection is to alert you in case the entity you are currently creating or modifying is a potential duplicate of another entity already existing in your genealogy.
The automatic detection of duplicates is activated by default in the Ancestris preferences.
As soon as you validate your entry in one of the editors, and if the corresponding preference box is checked, the detection automatically searches potential duplicates of the entity being edited.
All potential duplicates are then presented in the Merge window, for you to decide what you want to do with these duplicates.
In the case of the Cygnus editor where several entities can be edited at the same time, all modified entities are checked and therefore Ancestris will list in the Merge window all the potential duplicates of all the modified entities.
Manual Merge action
The purpose of the Manual Merge action is to merge two entities, regardless of whether Ancestris detected them or not.
Another purpose is to identify any duplicate for a given entity.
This action is accessible from the Context menu on the current entity you want to merge with another one.
When this action menu item is selected, Ancestris asks you which other entity the current entity is to be merged with.
- To really merge with another entity, pick the one you think is a duplicate.
- To just know whether the current entity has got duplicates in the genealogy, just choose any entity from the list of entities.
Then Ancestris displays the Merge window with a list of potential duplicates among which will be the pair of two entities you chose and its corresponding probability of being the same entity, which can be 0%.
The list will be sorted in decreasing probability and the shown duplicate pair will be your chosen pair of entities.
If the current entity you started from has no other found duplicates, only the pair of chosen entities will be shown in the list.
Then you may decide to merge the current entity you started from with the other chosen entity, or any other entity from the list.
Customization
There are 3 customization settings for the Merge tool.
- The search entity criteria, stored in the User Directory.
- The automatic detection flag, stored in the Preferences in the User Directory.
- The non duplicate pairs, stored in a special note.