Exploration of a Large Database of French Notarial Acts with Social Network Methods
This article illustrates how mathematical and statistical tools designed to handle relational data may be useful to help decipher the most important features and defects of a large historical database and to gain knowledge about a corpus made of several thousand documents. Such a relational model is...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Open Library of Humanities
2014-07-01
|
Series: | Digital Medievalist |
Subjects: | |
Online Access: | https://journal.digitalmedievalist.org/articles/52 |
id |
doaj-c7020dcc6c424c04b6cd5cf10e61aa9c |
---|---|
record_format |
Article |
spelling |
doaj-c7020dcc6c424c04b6cd5cf10e61aa9c2020-11-25T02:28:30ZengOpen Library of HumanitiesDigital Medievalist1715-07362014-07-01910.16995/dm.5252Exploration of a Large Database of French Notarial Acts with Social Network MethodsFabrice Rossi0Nathalie Villa-Vialaneix1Florent Hautefeuille2SAMM, Université Paris 1SAMM, Université Paris 1 & INRA, UR0875 MIA-T, Toulouse,TRACES, Université Toulouse 2This article illustrates how mathematical and statistical tools designed to handle relational data may be useful to help decipher the most important features and defects of a large historical database and to gain knowledge about a corpus made of several thousand documents. Such a relational model is generally enough to address a wide variety of problems, including most databases containing relational tables. In mathematics, it is referred to as a network or a graph. The article's purpose is to emphasise how a relevant relational model of a historical corpus can serve as a theoretical framework which makes available automatic data mining methods designed for graphs. By such methods, for one thing, consistency checking can be performed so as to extract possible transcription errors or interpretation errors during the transcription automatically. Moreover, when the database is so large that a human being is unable to gain much knowledge by even an exhaustive manual exploration, relational data mining can help elucidate the database's main features. First, the macroscopic structure of the relations between entities can be emphasised with the help of network summaries automatically produced by classification methods. A complementary point of view is obtained via local summaries of the relation structure: a set of network-related indicators can be calculated for each entity, singling out, for instance, highly connected entities. Finally, visualisation methods dedicated to graphs can be used to give the user an intuitive understanding of the database. Additional information can be superimposed on such network visualisations, making it possible to intuitively link the relations between entities using attributes that describe each entity. This overall approach is here illustrated with a large corpus of medieval notarial acts, containing several thousand transactions and involving a comparable number of persons.https://journal.digitalmedievalist.org/articles/52network analysistranscription error detectionnotarial actsdata mining in graphsclustering in graphs |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Fabrice Rossi Nathalie Villa-Vialaneix Florent Hautefeuille |
spellingShingle |
Fabrice Rossi Nathalie Villa-Vialaneix Florent Hautefeuille Exploration of a Large Database of French Notarial Acts with Social Network Methods Digital Medievalist network analysis transcription error detection notarial acts data mining in graphs clustering in graphs |
author_facet |
Fabrice Rossi Nathalie Villa-Vialaneix Florent Hautefeuille |
author_sort |
Fabrice Rossi |
title |
Exploration of a Large Database of French Notarial Acts with Social Network Methods |
title_short |
Exploration of a Large Database of French Notarial Acts with Social Network Methods |
title_full |
Exploration of a Large Database of French Notarial Acts with Social Network Methods |
title_fullStr |
Exploration of a Large Database of French Notarial Acts with Social Network Methods |
title_full_unstemmed |
Exploration of a Large Database of French Notarial Acts with Social Network Methods |
title_sort |
exploration of a large database of french notarial acts with social network methods |
publisher |
Open Library of Humanities |
series |
Digital Medievalist |
issn |
1715-0736 |
publishDate |
2014-07-01 |
description |
This article illustrates how mathematical and statistical tools designed to handle relational data may be useful to help decipher the most important features and defects of a large historical database and to gain knowledge about a corpus made of several thousand documents. Such a relational model is generally enough to address a wide variety of problems, including most databases containing relational tables. In mathematics, it is referred to as a network or a graph. The article's purpose is to emphasise how a relevant relational model of a historical corpus can serve as a theoretical framework which makes available automatic data mining methods designed for graphs. By such methods, for one thing, consistency checking can be performed so as to extract possible transcription errors or interpretation errors during the transcription automatically. Moreover, when the database is so large that a human being is unable to gain much knowledge by even an exhaustive manual exploration, relational data mining can help elucidate the database's main features. First, the macroscopic structure of the relations between entities can be emphasised with the help of network summaries automatically produced by classification methods. A complementary point of view is obtained via local summaries of the relation structure: a set of network-related indicators can be calculated for each entity, singling out, for instance, highly connected entities. Finally, visualisation methods dedicated to graphs can be used to give the user an intuitive understanding of the database. Additional information can be superimposed on such network visualisations, making it possible to intuitively link the relations between entities using attributes that describe each entity. This overall approach is here illustrated with a large corpus of medieval notarial acts, containing several thousand transactions and involving a comparable number of persons. |
topic |
network analysis transcription error detection notarial acts data mining in graphs clustering in graphs |
url |
https://journal.digitalmedievalist.org/articles/52 |
work_keys_str_mv |
AT fabricerossi explorationofalargedatabaseoffrenchnotarialactswithsocialnetworkmethods AT nathalievillavialaneix explorationofalargedatabaseoffrenchnotarialactswithsocialnetworkmethods AT florenthautefeuille explorationofalargedatabaseoffrenchnotarialactswithsocialnetworkmethods |
_version_ |
1724837567600263168 |