A study of semantics across different representations of language

Semantics is the study of meaning and here we explore it through three major representations: brain, image and text. Researchers in the past have performed various studies to understand the similarities between semantic features across all the three representations. Distributional Semantic (DS) mo...

Full description

Bibliographic Details
Main Author: Dharmaretnam, Dhanush
Other Authors: Fyshe, Alona
Format: Others
Language:English
en
Published: 2018
Subjects:
Online Access:https://dspace.library.uvic.ca//handle/1828/9399
id ndltd-uvic.ca-oai-dspace.library.uvic.ca-1828-9399
record_format oai_dc
spelling ndltd-uvic.ca-oai-dspace.library.uvic.ca-1828-93992018-05-29T17:31:57Z A study of semantics across different representations of language Dharmaretnam, Dhanush Fyshe, Alona Computational linguistics Semantics Semantics in Brain Convolutional Neural Networks Deep learning Semantics is the study of meaning and here we explore it through three major representations: brain, image and text. Researchers in the past have performed various studies to understand the similarities between semantic features across all the three representations. Distributional Semantic (DS) models or word vectors that are trained on text corpora have been widely used to study the convergence of semantic information in the human brain. Moreover, they have been incorporated into various NLP applications such as document categorization, speech to text and machine translation. Due to their widespread adoption by researchers and industry alike, it becomes imperative to test and evaluate the performance of di erent word vectors models. In this thesis, we publish the second iteration of BrainBench: a system designed to evaluate and benchmark word vectors using brain data by incorporating two new Italian brain datasets collected using fMRI and EEG technology. In the second half of the thesis, we explore semantics in Convolutional Neural Network (CNN). CNN is a computational model that is the state of the art technology for object recognition from images. However, these networks are currently considered a black-box and there is an apparent lack of understanding on why various CNN architectures perform better than the other. In this thesis, we also propose a novel method to understand CNNs by studying the semantic representation through its hierarchical layers. The convergence of semantic information in these networks is studied with the help of DS models following similar methodologies used to study semantics in the human brain. Our results provide substantial evidence that Convolutional Neural Networks do learn semantics from the images, and the features learned by the CNNs correlate to the semantics of the object in the image. Our methodology and results could potentially pave the way for improved design and debugging of CNNs. Graduate 2018-05-28T20:59:24Z 2018-05-28T20:59:24Z 2018 2018-05-28 Thesis https://dspace.library.uvic.ca//handle/1828/9399 English en Available to the World Wide Web application/pdf
collection NDLTD
language English
en
format Others
sources NDLTD
topic Computational linguistics
Semantics
Semantics in Brain
Convolutional Neural Networks
Deep learning
spellingShingle Computational linguistics
Semantics
Semantics in Brain
Convolutional Neural Networks
Deep learning
Dharmaretnam, Dhanush
A study of semantics across different representations of language
description Semantics is the study of meaning and here we explore it through three major representations: brain, image and text. Researchers in the past have performed various studies to understand the similarities between semantic features across all the three representations. Distributional Semantic (DS) models or word vectors that are trained on text corpora have been widely used to study the convergence of semantic information in the human brain. Moreover, they have been incorporated into various NLP applications such as document categorization, speech to text and machine translation. Due to their widespread adoption by researchers and industry alike, it becomes imperative to test and evaluate the performance of di erent word vectors models. In this thesis, we publish the second iteration of BrainBench: a system designed to evaluate and benchmark word vectors using brain data by incorporating two new Italian brain datasets collected using fMRI and EEG technology. In the second half of the thesis, we explore semantics in Convolutional Neural Network (CNN). CNN is a computational model that is the state of the art technology for object recognition from images. However, these networks are currently considered a black-box and there is an apparent lack of understanding on why various CNN architectures perform better than the other. In this thesis, we also propose a novel method to understand CNNs by studying the semantic representation through its hierarchical layers. The convergence of semantic information in these networks is studied with the help of DS models following similar methodologies used to study semantics in the human brain. Our results provide substantial evidence that Convolutional Neural Networks do learn semantics from the images, and the features learned by the CNNs correlate to the semantics of the object in the image. Our methodology and results could potentially pave the way for improved design and debugging of CNNs. === Graduate
author2 Fyshe, Alona
author_facet Fyshe, Alona
Dharmaretnam, Dhanush
author Dharmaretnam, Dhanush
author_sort Dharmaretnam, Dhanush
title A study of semantics across different representations of language
title_short A study of semantics across different representations of language
title_full A study of semantics across different representations of language
title_fullStr A study of semantics across different representations of language
title_full_unstemmed A study of semantics across different representations of language
title_sort study of semantics across different representations of language
publishDate 2018
url https://dspace.library.uvic.ca//handle/1828/9399
work_keys_str_mv AT dharmaretnamdhanush astudyofsemanticsacrossdifferentrepresentationsoflanguage
AT dharmaretnamdhanush studyofsemanticsacrossdifferentrepresentationsoflanguage
_version_ 1718682429792714752