﻿<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
	<front>
		<journal-meta>
			<journal-id journal-id-type="publisher-id">INFORMATICA</journal-id>
			<journal-title-group>
				<journal-title>Informatica</journal-title>
			</journal-title-group>
			<issn pub-type="epub">1822-8844</issn>
			<issn pub-type="ppub">0868-4952</issn>
			<issn-l>0868-4952</issn-l>
			<publisher>
				<publisher-name>Vilnius University Institute of Mathematics and Informatics</publisher-name>
				<publisher-loc>Akademijos 4, LT-08663 Vilnius, Lithuania</publisher-loc>
			</publisher>
		</journal-meta>
		<article-meta>
			<article-id pub-id-type="publisher-id">INFO585</article-id>
			<article-id pub-id-type="doi">10.15388/Informatica.2005.084</article-id>
			<article-categories>
				<subj-group subj-group-type="heading">
					<subject>Research Article</subject>
				</subj-group>
			</article-categories>
			<title-group>
				<article-title>Multivariate Data Clustering for the Gaussian Mixture Model</article-title>
			</title-group>
			<contrib-group>
				<contrib contrib-type="author">
					<name>
						<surname>Kavaliauskas</surname>
						<given-names>Mindaugas</given-names>
					</name>
					<email xlink:href="mailto:snaiperiui@takas.lt">snaiperiui@takas.lt</email>
					<xref ref-type="aff" rid="j_info585_aff_001">a</xref>
				</contrib>
				<contrib contrib-type="author">
					<name>
						<surname>Rudzkis</surname>
						<given-names>Rimantas</given-names>
					</name>
					<email xlink:href="mailto:rudzkis@ktl.mii.lt">rudzkis@ktl.mii.lt</email>
					<xref ref-type="aff" rid="j_info585_aff_002">b</xref>
				</contrib>
				<aff id="j_info585_aff_001">
					<label>a</label>Faculty of Fundamental Science, <institution>Kaunas University of Technology</institution>, Donelaičio 72, LT-3000 Kaunas, <country>Lithuania</country>
				</aff>
				<aff id="j_info585_aff_002">
					<label>b</label>Department of Applied Statistics, <institution>Institute of Mathematics and Informatics</institution>, Akademijos 4, 08663 Vilnius, <country>Lithuania</country>
				</aff>
			</contrib-group>
			<pub-date pub-type="ppub">
				<year>2005</year>
			</pub-date>
			<volume>16</volume>
			<issue>1</issue>
			<fpage>61</fpage>
			<lpage>74</lpage>
			<history>
				<date date-type="received">
					<day>1</day>
					<month>6</month>
					<year>2004</year>
				</date>
			</history>
			<permissions>
				<copyright-statement>© 2005 Institute of Mathematics and Informatics, Vilnius</copyright-statement>
				<copyright-year>2005</copyright-year>
				<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
					<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p>
				</license>
			</permissions>
			<abstract>
				<p>This paper discusses a soft sample clustering problem for multivariate independent random data satisfying the mixture model of the Gaussian distribution. The theory recommends to estimate the parameters of model by the maximum likelihood method and to use “plug-in” approach for data clustering. Unfortunately, the calculation problem of the maximum likelihood estimate is not completely solved in multivariate case. This work proposes a new constructive a few stage procedure to solve this task. This procedure includes statistical distribution analysis of a large number of the univariate projections of observations, geometric clustering of a multivariate sample and application of EM algorithm. The results of the accuracy analysis of the proposed methods is made by means of Monte-Carlo simulation.</p>
			</abstract>
			<kwd-group>
				<label>Key words</label>
				<kwd>clustering</kwd>
				<kwd>multivariate data</kwd>
				<kwd>Gaussian mixture model</kwd>
				<kwd>projection-based clustering</kwd>
				<kwd>EM algorithm</kwd>
			</kwd-group>
		</article-meta>
	</front>
</article>