Automatic author profiling of online chat logs
Now that the Internet has become easily accessible and more affordable, a larger number of people spend more time in front of a computer. Some spend so much time on the Internet that they develop friendships and relationships - people with whom they have regular contact via a computer screen and th...
Main Author: | |
---|---|
Other Authors: | |
Published: |
Monterey, California. Naval Postgraduate School
2012
|
Online Access: | http://hdl.handle.net/10945/3559 |
id |
ndltd-nps.edu-oai-calhoun.nps.edu-10945-3559 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-nps.edu-oai-calhoun.nps.edu-10945-35592014-11-27T16:04:43Z Automatic author profiling of online chat logs Lin, Jane. Martell, Craig H. Squire, Kevin M. Naval Postgraduate School (U.S.) Computer Science Now that the Internet has become easily accessible and more affordable, a larger number of people spend more time in front of a computer. Some spend so much time on the Internet that they develop friendships and relationships - people with whom they have regular contact via a computer screen and the Internet. While most of the dialogue exchanged online is not harmful or illegal, ther are those with dishonest intentions lurking online. These people can be breaking the law by seducing a minor virtually or even going as far as meeting a minor in person. Terrorists can also use the Internet to facilitate communication and plan attacks. Since e-mail is one of the original means of communication on the Internet, methods for determining the author of an email have already been studied. So far, however, no significant experimentation with online chat logs exist. The first of part of this study is comprised of generating an unbiased, random, and broad corpus of online chat logs. Having a general corpus with a wide-range of topics allows the results of this research to be applied in the most general case. Because developing a complete solution fto the authorship attribution problem for chat logs is difficult, we limit our scope to predicting gender and age. The ultimate goal of the work, then, is to facilitate the jobs of law enforcers in tracking down criminals who attempt to use the Internet as a hiding place. 2012-03-14T17:38:43Z 2012-03-14T17:38:43Z 2007-03 Thesis http://hdl.handle.net/10945/3559 133185744 Approved for public release, distribution unlimited Monterey, California. Naval Postgraduate School |
collection |
NDLTD |
sources |
NDLTD |
description |
Now that the Internet has become easily accessible and more affordable, a larger number of people spend more time in front of a computer. Some spend so much time on the Internet that they develop friendships and relationships - people with whom they have regular contact via a computer screen and the Internet. While most of the dialogue exchanged online is not harmful or illegal, ther are those with dishonest intentions lurking online. These people can be breaking the law by seducing a minor virtually or even going as far as meeting a minor in person. Terrorists can also use the Internet to facilitate communication and plan attacks. Since e-mail is one of the original means of communication on the Internet, methods for determining the author of an email have already been studied. So far, however, no significant experimentation with online chat logs exist. The first of part of this study is comprised of generating an unbiased, random, and broad corpus of online chat logs. Having a general corpus with a wide-range of topics allows the results of this research to be applied in the most general case. Because developing a complete solution fto the authorship attribution problem for chat logs is difficult, we limit our scope to predicting gender and age. The ultimate goal of the work, then, is to facilitate the jobs of law enforcers in tracking down criminals who attempt to use the Internet as a hiding place. |
author2 |
Martell, Craig H. |
author_facet |
Martell, Craig H. Lin, Jane. |
author |
Lin, Jane. |
spellingShingle |
Lin, Jane. Automatic author profiling of online chat logs |
author_sort |
Lin, Jane. |
title |
Automatic author profiling of online chat logs |
title_short |
Automatic author profiling of online chat logs |
title_full |
Automatic author profiling of online chat logs |
title_fullStr |
Automatic author profiling of online chat logs |
title_full_unstemmed |
Automatic author profiling of online chat logs |
title_sort |
automatic author profiling of online chat logs |
publisher |
Monterey, California. Naval Postgraduate School |
publishDate |
2012 |
url |
http://hdl.handle.net/10945/3559 |
work_keys_str_mv |
AT linjane automaticauthorprofilingofonlinechatlogs |
_version_ |
1716720794025852928 |