Automatic author profiling of online chat logs

Now that the Internet has become easily accessible and more affordable, a larger number of people spend more time in front of a computer. Some spend so much time on the Internet that they develop friendships and relationships - people with whom they have regular contact via a computer screen and th...

Full description

Bibliographic Details
Main Author: Lin, Jane.
Other Authors: Martell, Craig H.
Published: Monterey, California. Naval Postgraduate School 2012
Online Access:http://hdl.handle.net/10945/3559
id ndltd-nps.edu-oai-calhoun.nps.edu-10945-3559
record_format oai_dc
spelling ndltd-nps.edu-oai-calhoun.nps.edu-10945-35592014-11-27T16:04:43Z Automatic author profiling of online chat logs Lin, Jane. Martell, Craig H. Squire, Kevin M. Naval Postgraduate School (U.S.) Computer Science Now that the Internet has become easily accessible and more affordable, a larger number of people spend more time in front of a computer. Some spend so much time on the Internet that they develop friendships and relationships - people with whom they have regular contact via a computer screen and the Internet. While most of the dialogue exchanged online is not harmful or illegal, ther are those with dishonest intentions lurking online. These people can be breaking the law by seducing a minor virtually or even going as far as meeting a minor in person. Terrorists can also use the Internet to facilitate communication and plan attacks. Since e-mail is one of the original means of communication on the Internet, methods for determining the author of an email have already been studied. So far, however, no significant experimentation with online chat logs exist. The first of part of this study is comprised of generating an unbiased, random, and broad corpus of online chat logs. Having a general corpus with a wide-range of topics allows the results of this research to be applied in the most general case. Because developing a complete solution fto the authorship attribution problem for chat logs is difficult, we limit our scope to predicting gender and age. The ultimate goal of the work, then, is to facilitate the jobs of law enforcers in tracking down criminals who attempt to use the Internet as a hiding place. 2012-03-14T17:38:43Z 2012-03-14T17:38:43Z 2007-03 Thesis http://hdl.handle.net/10945/3559 133185744 Approved for public release, distribution unlimited Monterey, California. Naval Postgraduate School
collection NDLTD
sources NDLTD
description Now that the Internet has become easily accessible and more affordable, a larger number of people spend more time in front of a computer. Some spend so much time on the Internet that they develop friendships and relationships - people with whom they have regular contact via a computer screen and the Internet. While most of the dialogue exchanged online is not harmful or illegal, ther are those with dishonest intentions lurking online. These people can be breaking the law by seducing a minor virtually or even going as far as meeting a minor in person. Terrorists can also use the Internet to facilitate communication and plan attacks. Since e-mail is one of the original means of communication on the Internet, methods for determining the author of an email have already been studied. So far, however, no significant experimentation with online chat logs exist. The first of part of this study is comprised of generating an unbiased, random, and broad corpus of online chat logs. Having a general corpus with a wide-range of topics allows the results of this research to be applied in the most general case. Because developing a complete solution fto the authorship attribution problem for chat logs is difficult, we limit our scope to predicting gender and age. The ultimate goal of the work, then, is to facilitate the jobs of law enforcers in tracking down criminals who attempt to use the Internet as a hiding place.
author2 Martell, Craig H.
author_facet Martell, Craig H.
Lin, Jane.
author Lin, Jane.
spellingShingle Lin, Jane.
Automatic author profiling of online chat logs
author_sort Lin, Jane.
title Automatic author profiling of online chat logs
title_short Automatic author profiling of online chat logs
title_full Automatic author profiling of online chat logs
title_fullStr Automatic author profiling of online chat logs
title_full_unstemmed Automatic author profiling of online chat logs
title_sort automatic author profiling of online chat logs
publisher Monterey, California. Naval Postgraduate School
publishDate 2012
url http://hdl.handle.net/10945/3559
work_keys_str_mv AT linjane automaticauthorprofilingofonlinechatlogs
_version_ 1716720794025852928