Introduction
This web site gives access to on-line demos of programs, written in C++, I have implemented during the last years regarding the so-called Hidden Markov Chain (HMC) model for time-series analysis. Source codes can be downloaded here. Programs are concerned with the unsupervised restoration of recent extensions of HMC models, e.g. Noise-independent HMC (the classical model as described by L. R. Rabiner), Pairwise Markov Chain (including HMC with correlated noise)...
Make your choice on the left menu, set arguments (number of classes, number of iterations, data-driven pdf...) and provide your data file to get, in return, a file with classified data. For each demo program, a data file example is provided for testing purposes.
Most of programs have been developed from the work and with the collaboration of Prof. Wojciech Pieczynski. Bibliographical references regarding the underlying models (and many more!) can be found in its web pages. You can also check for draft papers on my personal pages.
Stéphane Derrode
K-means classification algorithm
Short description This is the classical kmeans classification algorithm. You can set vectorial data. Examples of data file are provided below.
Form
Help on parameters
| -Y | File names containing observations; If several files are set : all file must have the same number of samples. |
| -e | Number of iterations (>0); classical value: 30 |
| -K | Class number (>1); classical value: 2 |
Data files for testing
- Y_BLIND.txt: file of 1000 data generated according to a mixture model.
- fleur.txt: Peano-decomposed text file of the fleur.pgm gray-scale image (dim: 250x320).
- fleurR.txt, fleurG.txt, fleurB.txt: Peano-decomposed text file of the three bands of the fleur.jpg color image (dim: 250x320). Bands in pgm format are: fleurR.pgm, fleurG.pgm, fleurB.pgm
Mixture model for iid data
Short description This is the classical mixture model for iid data. You can set vectorial data. Examples of data file are provided below.
Form
Help on parameters
| -Y | File names containing observations; If several files are set : (i) all file must have the same number of samples; (ii) Gaussian copulas for multi-dimensionnal data-driven densities are assumed. | ||||||||||||||
| -E | Estimation method [EM/SEM/ICE]; | ||||||||||||||
| -e | Number of iterations (0=no iter=kmeans only); classical value: 300 | ||||||||||||||
| -K | Class number (>1); classical value: 2 | ||||||||||||||
| -l | Data driven density types
|
Data files for testing
- Y_BLIND.txt: file of 1000 data generated according to a mixture model.
- fleur.txt: Peano-decomposed text file of the fleur.pgm gray-scale image (dim: 250x320).
- fleurR.txt, fleurG.txt, fleurB.txt: Peano-decomposed text file of the three bands of the fleur.jpg color image (dim: 250x320). Bands in pgm format are: fleurR.pgm, fleurG.pgm, fleurB.pgm
Multiscale or Joint mixture model (JMM - PMM) for iid data
Short description This is the multiscale mixture model (or mixture of mixture, or joint mixture model) for iid data. You can set vectorial data. Examples of data file are provided below.
Form
Help on parameters
| -Y | File names containing observations; If several files are set : (i) all file must have the same number of samples; (ii) Gaussian copulas for multi-dimensionnal data-driven densities are assumed. | ||||||||||||||||||
| -E | Estimation method [EM/SEM/ICE]; | ||||||||||||||||||
| -e | Number of iterations (0=no iter=kmeans only); classical value: 300 | ||||||||||||||||||
| -K | Multiscale and class numbers; for example 2:3 means 2 scales and two classes for the upper scale, and three classes for the lower. | ||||||||||||||||||
| -c | Copula type
|
||||||||||||||||||
| -l | Data driven density types
|
||||||||||||||||||
| -p | Data driven density types
|
Data files for testing
- Y_BLIND.txt: file of 1000 data generated according to a mixture model.
- fleur.txt: Peano-decomposed text file of the fleur.pgm gray-scale image (dim: 250x320).
- fleurR.txt, fleurG.txt, fleurB.txt: Peano-decomposed text file of the three bands of the fleur.jpg color image (dim: 250x320). Bands in pgm format are: fleurR.pgm, fleurG.pgm, fleurB.pgm
HMC with independent noise
Short description This is the clasical HMC-IN (independent noise) model. You can set vectorial data. Examples of data file are provided below. Also the variant below.
Form
Help on parameters
| -Y | File names containing observations; If several files are set : (i) all file must have the same number of samples; (ii) Gaussian copulas for multi-dimensionnal data-driven densities are assumed. | ||||||||||||||
| -E | Estimation method [EM/SEM/ICE]; | ||||||||||||||
| -e | Number of iterations (0=no iter=kmeans only); classical value: 50 | ||||||||||||||
| -K | Class number (>1); classical value: 2 | ||||||||||||||
| -l | Data driven density types
|
Data files for testing
- Y_HMC-IN.txt: file of 1000 data generated according to a HMC-IN model.
- fleur.txt: Peano-decomposed text file of the fleur.pgm gray-scale image (dim: 250x320).
- fleurR.txt, fleurG.txt, fleurB.txt: Peano-decomposed text file of the three bands of the fleur.jpg color image (dim: 250x320). Bands in pgm format are: fleurR.pgm, fleurG.pgm, fleurB.pgm
HMC with independent noise (local estimation)
Short description This variant uses a bootstrap strategy to estimate HMC parameters from a subsample. This model is of special interest for very huge data set. But, due to server limitation, you can only test this program for not too big datafile (less than 500000 bytes). So, the goal of including this demo here is to check that bootstrap estimation is robust and gives very similar compared to the classical estimation algorithm.
Form
Help on parameters
| -Y | File name containing observations; | ||||||||||||||
| -E | Estimation method [EM/SEM/ICE]; | ||||||||||||||
| -e | Number of iterations (0=no iter=kmeans only); classical value: 50 | ||||||||||||||
| -K | Class number (>1); classical value: 2 | ||||||||||||||
| -l | Data driven density types
|
||||||||||||||
| -s | Local size support >0; classical value: 5 | ||||||||||||||
| -b | Sub-sample type (0:all or 3:bootstrap); classical value: 3 |
HMC with dependent noise
Short description This is the non-clasical HMC-DN (dependant noise) model (particular case of PMC model for which th state process is also markovian). Copulas are used to model the bi-dimensional data driven densities.
Form
Help on parameters
| -Y | File name containing observations; | ||||||||||||||
| -E | Estimation method [SEM/ICE]; | ||||||||||||||
| -e | Number of iterations (0=no iter=kmeans only); classical value: 50 | ||||||||||||||
| -K | Class number (>1); classical value: 2 | ||||||||||||||
| -l | Data driven density types
|
||||||||||||||
| -c | List of 4 copulas, each to choose in [0,8] (0:product; 1:Gaussian; 2:Student; 3:Gumbel; 4:Farlie-Gumbel-Morgenstern; 5:Cubic; 6: Clayton; 7:A12; 8:A14); classical value: 1:1:1:1 |
Data files for testing
- Y_HMC-DN.txt: file of 1000 data generated according to a HMC-DN model.
- fleur.txt: Peano-decomposed text file of the fleur.pgm gray-scale image (dim: 250x320).
- fleurR.txt, fleurG.txt, fleurB.txt: Peano-decomposed text file of the three bands of the fleur.jpg color image (dim: 250x320). Bands in pgm format are: fleurR.pgm, fleurG.pgm, fleurB.pgm
Pairwise Markov Chain
Short description This is the general PMC model with copulas.
Form
Help on parameters
| -E | Estimation method [SEM/ICE]; | |||||||||
| -Y | File name containing observations; | |||||||||
| -e | Number of iterations (0=no iter=kmeans only); classical value: 50 | |||||||||
| -K | Class number (>1); classical value: 2 | |||||||||
| -l | Data driven density types (0=Gaussian; 1=Gamma; 2=8_2_7 in Pearson system; 3=2+1; 4=all Pearson); classical value for "-K 2" classes: 0:0:0:0 (four gaussians) | |||||||||
| -c | List of K*K copulas, separated by ':', each to choose in [0,8]
The choise for the K*K copulas is not completely free. For example, if K=3, due to some symmetry condition, we must write "c1:c2:c3:c2:c4:c5:c3:c5:c6", which corresponds to:
|
Data files for testing
- Y_PMC.txt: file of 1000 data generated according to a PMC model.
- fleur.txt: Peano-decomposed text file of the fleur.pgm gray-scale image (dim: 250x320).
- fleurR.txt, fleurG.txt, fleurB.txt: Peano-decomposed text file of the three bands of the fleur.jpg color image (dim: 250x320). Bands in pgm format are: fleurR.pgm, fleurG.pgm, fleurB.pgm
Direct Peano scan of a pgm image
Short description The Hilbert-Peano scan demo allow to signalize your image (i.e. 2D->1D). The inverse scan (see below) allows to reconstruct the classified signal (i.e. 1D->2D).
Form
Help on parameters
| -Y | Image filename (pgm format only - odd dimensions. Ex: 308x204); |
Image files for testing
- Lena128.pgm: classical gray-scal image of Lena.
- fleur.pgm: gray-scal image of a fower.
Inverse Peano scan of a txt file
Image files for testing
- Lena128.txt: classical gray-scal image of Lena.
- fleur.txt: gray-scal image of a fower.
Help on parameters
| -J | Txt filename containing a Peano scan to be image-reconstructed; |
| -D | Dimensions of the image to be recomposed (#raw,#column) |
Perform all classifications from one data file
Short description This demo classifies a data file with all available demos.
Form
Help on parameters
| -Y | File name containing observations; |
| -E | Estimation method [SEM/ICE]; |
| -K | Class number (>1); classical value: 2 |
Last classification results
Impossible!Impossible!