This HTML5 document contains 23 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

PrefixIRI
dctermshttp://purl.org/dc/terms/
n2https://kar.kent.ac.uk/id/eprint/
wdrshttp://www.w3.org/2007/05/powder-s#
n16http://purl.org/ontology/bibo/status/
rdfshttp://www.w3.org/2000/01/rdf-schema#
n15https://kar.kent.ac.uk/id/subject/
n12doi:10.1109/
n9https://demo.openlinksw.com/about/id/entity/https/raw.githubusercontent.com/annajordanous/CO644Files/main/
n6http://eprints.org/ontology/
bibohttp://purl.org/ontology/bibo/
n5https://kar.kent.ac.uk/id/publication/
n13https://kar.kent.ac.uk/id/org/
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
owlhttp://www.w3.org/2002/07/owl#
n14https://kar.kent.ac.uk/id/eprint/90112#
n7https://kar.kent.ac.uk/id/
n19https://kar.kent.ac.uk/90112/
xsdhhttp://www.w3.org/2001/XMLSchema#
n17https://demo.openlinksw.com/about/id/entity/https/www.cs.kent.ac.uk/people/staff/akj22/materials/CO644/
n4https://kar.kent.ac.uk/id/person/

Statements

Subject Item
n2:90112
rdf:type
n6:EPrint bibo:AcademicArticle bibo:Article n6:ArticleEPrint
rdfs:seeAlso
n19:
owl:sameAs
n12:TCYB.2017.2753880
dcterms:title
A Generalized Methodology for Data Analysis
wdrs:describedby
n9:export_kar_RDFN3.n3 n17:export_kar_RDFN3.n3
dcterms:date
2018-10-01
dcterms:creator
n4:ext-f51891477848e4c7f127c7434b2f419a n4:ext-cd132263f24aac658abad18c9a801ab6 n4:ext-x.gu@kent.ac.uk
bibo:status
n16:peerReviewed n16:published
dcterms:publisher
n13:ext-af0a9a5baed87c407844a3f5db44597c
bibo:abstract
Based on a critical analysis of data analytics and its foundations, we propose a functional approach to estimate data ensemble properties, which is based entirely on the empirical observations of discrete data samples and the relative proximity of these points in the data space and hence named empirical data analysis (EDA). The ensemble functions include the nonparametric square centrality (a measure of closeness used in graph theory) and typicality (an empirically derived quantity which resembles probability). A distinctive feature of the proposed new functional approach to data analysis is that it does not assume randomness or determinism of the empirically observed data, nor independence. The typicality is derived from the discrete data directly in contrast to the traditional approach, where a continuous probability density function is assumed a priori. The typicality is expressed in a closed analytical form that can be calculated recursively and, thus, is computationally very efficient. The proposed nonparametric estimators of the ensemble properties of the data can also be interpreted as a discrete form of the information potential (known from the information theoretic learning theory as well as the Parzen windows). Therefore, EDA is very suitable for the current move to a data-rich environment, where the understanding of the underlying phenomena behind the available vast amounts of data is often not clear. We also present an extension of EDA for inference. The areas of applications of the new methodology of the EDA are wide because it concerns the very foundation of data analysis. Preliminary tests show its good performance in comparison to traditional techniques.
dcterms:isPartOf
n5:ext-21682267 n7:repository
dcterms:subject
n15:QA
bibo:authorList
n14:authors
bibo:issue
10
bibo:volume
48