Exploration of a Large Database of French Notarial Acts with Social Network Methods

This article illustrates how mathematical and statistical tools designed to handle relational data may be useful to help decipher the most important features and defects of a large historical database and to gain knowledge about a corpus made of several thousand documents. Such a relational model is...

Full description

Bibliographic Details
Main Authors: Fabrice Rossi, Nathalie Villa-Vialaneix, Florent Hautefeuille
Format: Article
Language:English
Published: Open Library of Humanities 2014-07-01
Series:Digital Medievalist
Subjects:
Online Access:https://journal.digitalmedievalist.org/articles/52
id doaj-c7020dcc6c424c04b6cd5cf10e61aa9c
record_format Article
spelling doaj-c7020dcc6c424c04b6cd5cf10e61aa9c2020-11-25T02:28:30ZengOpen Library of HumanitiesDigital Medievalist1715-07362014-07-01910.16995/dm.5252Exploration of a Large Database of French Notarial Acts with Social Network MethodsFabrice Rossi0Nathalie Villa-Vialaneix1Florent Hautefeuille2SAMM, Université Paris 1SAMM, Université Paris 1 & INRA, UR0875 MIA-T, Toulouse,TRACES, Université Toulouse 2This article illustrates how mathematical and statistical tools designed to handle relational data may be useful to help decipher the most important features and defects of a large historical database and to gain knowledge about a corpus made of several thousand documents. Such a relational model is generally enough to address a wide variety of problems, including most databases containing relational tables. In mathematics, it is referred to as a network or a graph. The article's purpose is to emphasise how a relevant relational model of a historical corpus can serve as a theoretical framework which makes available automatic data mining methods designed for graphs. By such methods, for one thing, consistency checking can be performed so as to extract possible transcription errors or interpretation errors during the transcription automatically. Moreover, when the database is so large that a human being is unable to gain much knowledge by even an exhaustive manual exploration, relational data mining can help elucidate the database's main features. First, the macroscopic structure of the relations between entities can be emphasised with the help of network summaries automatically produced by classification methods. A complementary point of view is obtained via local summaries of the relation structure: a set of network-related indicators can be calculated for each entity, singling out, for instance, highly connected entities. Finally, visualisation methods dedicated to graphs can be used to give the user an intuitive understanding of the database. Additional information can be superimposed on such network visualisations, making it possible to intuitively link the relations between entities using attributes that describe each entity. This overall approach is here illustrated with a large corpus of medieval notarial acts, containing several thousand transactions and involving a comparable number of persons.https://journal.digitalmedievalist.org/articles/52network analysistranscription error detectionnotarial actsdata mining in graphsclustering in graphs
collection DOAJ
language English
format Article
sources DOAJ
author Fabrice Rossi
Nathalie Villa-Vialaneix
Florent Hautefeuille
spellingShingle Fabrice Rossi
Nathalie Villa-Vialaneix
Florent Hautefeuille
Exploration of a Large Database of French Notarial Acts with Social Network Methods
Digital Medievalist
network analysis
transcription error detection
notarial acts
data mining in graphs
clustering in graphs
author_facet Fabrice Rossi
Nathalie Villa-Vialaneix
Florent Hautefeuille
author_sort Fabrice Rossi
title Exploration of a Large Database of French Notarial Acts with Social Network Methods
title_short Exploration of a Large Database of French Notarial Acts with Social Network Methods
title_full Exploration of a Large Database of French Notarial Acts with Social Network Methods
title_fullStr Exploration of a Large Database of French Notarial Acts with Social Network Methods
title_full_unstemmed Exploration of a Large Database of French Notarial Acts with Social Network Methods
title_sort exploration of a large database of french notarial acts with social network methods
publisher Open Library of Humanities
series Digital Medievalist
issn 1715-0736
publishDate 2014-07-01
description This article illustrates how mathematical and statistical tools designed to handle relational data may be useful to help decipher the most important features and defects of a large historical database and to gain knowledge about a corpus made of several thousand documents. Such a relational model is generally enough to address a wide variety of problems, including most databases containing relational tables. In mathematics, it is referred to as a network or a graph. The article's purpose is to emphasise how a relevant relational model of a historical corpus can serve as a theoretical framework which makes available automatic data mining methods designed for graphs. By such methods, for one thing, consistency checking can be performed so as to extract possible transcription errors or interpretation errors during the transcription automatically. Moreover, when the database is so large that a human being is unable to gain much knowledge by even an exhaustive manual exploration, relational data mining can help elucidate the database's main features. First, the macroscopic structure of the relations between entities can be emphasised with the help of network summaries automatically produced by classification methods. A complementary point of view is obtained via local summaries of the relation structure: a set of network-related indicators can be calculated for each entity, singling out, for instance, highly connected entities. Finally, visualisation methods dedicated to graphs can be used to give the user an intuitive understanding of the database. Additional information can be superimposed on such network visualisations, making it possible to intuitively link the relations between entities using attributes that describe each entity. This overall approach is here illustrated with a large corpus of medieval notarial acts, containing several thousand transactions and involving a comparable number of persons.
topic network analysis
transcription error detection
notarial acts
data mining in graphs
clustering in graphs
url https://journal.digitalmedievalist.org/articles/52
work_keys_str_mv AT fabricerossi explorationofalargedatabaseoffrenchnotarialactswithsocialnetworkmethods
AT nathalievillavialaneix explorationofalargedatabaseoffrenchnotarialactswithsocialnetworkmethods
AT florenthautefeuille explorationofalargedatabaseoffrenchnotarialactswithsocialnetworkmethods
_version_ 1724837567600263168