About: Improving Training of Deep Neural Network Sequence Models

Not logged in : Login

(Sponging disallowed)

Facets (new session)
Description
Metadata
Settings
- Rule:
- Inverse Functional Properties:
- "Same As":

About: Improving Training of Deep Neural Network Sequence Models Goto Sponge NotDistinct Permalink

An Entity of Type : bibo:Thesis, within Data Space : demo.openlinksw.com associated with source document(s)

Attributes	Values
type	http://eprints.org/ontology/EPrint http://eprints.org/ontology/ThesisEPrint Article Thesis
seeAlso	HTML Summary of #81637 Improving Training of Deep Neural Network Sequence Models
http://www.loc.gov...erms/relators/THS	A. Freitas Marek Grzes
http://eprints.org/ontology/hasDocument	Improving Training of Deep Neural Network Sequence Models (PDF) Improving Training of Deep Neural Network Sequence Models (Other) Improving Training of Deep Neural Network Sequence Models (Other) Improving Training of Deep Neural Network Sequence Models (Other) Improving Training of Deep Neural Network Sequence Models (Other) Improving Training of Deep Neural Network Sequence Models (Other)
dcterms:issuer	University of Kent, School of Computing, University of Kent,
Title	Improving Training of Deep Neural Network Sequence Models
described by	https://demo.openlinksw.com/about/id/entity/https/www.cs.kent.ac.uk/people/staff/akj22/materials/CO644/export_kar_RDFN3.n3 https://demo.openlinksw.com/about/id/entity/https/raw.githubusercontent.com/annajordanous/CO644Files/main/export_kar_RDFN3.n3
Date	2019-08
Creator	Farhana Ferdousi Liza
abstract	Sequence models, in particular, language models are fundamental building blocks of downstream applications including speech recognition, speech synthesis, information retrieval, machine translation, and question answering systems. Neural network language models are effective in generalising (i.e. perform efficiently with the data sparsity problem) compared to traditional N-grams models. However, neural network language models have several fundamental problems - the training of neural network language models is computationally inefficient and analysing the trained models is difficult. In this thesis, improvement techniques to reduce the computational complexity and an extensive analysis of the learned models are presented. To reduce the computational complexity we have focused on the main computational bottleneck of neural training which is the softmax operation. Among different softmax approximation techniques, Noise Contrastive Estimation (NCE) is seen as a method that often does not work well with deep neural models for language modelling. A thorough investigation was done to find out the appropriate and novel integration mechanism of NCE with deep neural networks. We have also explained why the proposed specific hyperparameter settings could have an impact on the integration. Existing analysis techniques are not sufficient to explain the training and learned models. Established wisdom on learning theory cannot explain the generalisation of over-parametrised deep neural networks. Therefore, we have proposed methods and analysis techniques to understand the generalisation and explain the regularisation. Furthermore, we have explained the impact of the stacked layers in deep neural networks. The presented techniques have made the neural language models more accurate and computationally efficient. The empirical analysis techniques have helped us understand the model learning and improved our understanding of the generalisation and regularisation. The conducted experiments were based on publicly available benchmark datasets and standard evaluation frameworks.
Is Part Of	https://kar.kent.ac.uk/id/repository
list of authors	https://kar.kent.ac.uk/id/eprint/81637#authors
degree	PhD degree
is topic of	https://www.cs.kent.ac.uk/people/staff/akj22/materials/CO644/export_kar_RDFN3.n3 https://raw.githubusercontent.com/annajordanous/CO644Files/main/export_kar_RDFN3.n3
is primary topic of	HTML Summary of #81637 Improving Training of Deep Neural Network Sequence Models

Faceted Search & Find service v1.17_git144 as of Jul 26 2024

Alternative Linked Data Documents: iSPARQL | ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 08.03.3331 as of Aug 25 2024, on Linux (x86_64-ubuntu_noble-linux-glibc2.38-64), Single-Server Edition (378 GB total memory, 22 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software