Recherche

SelvarClust (Apprentissage)

Variable selection in model-based clustering.

It is devoted to the variable selection in model-based clustering.

It is the greedy algorithm associated to the SR modeling proposed by C. Maugis, G. Celeux and M.-L. Martin-Magniette in [1] and [2], modifying the method of Raftery and Dean [3].

This software allows to study data where individuals are described by quantitative block variables. It returns a data clustering and the selected model, composed of the number of clusters, the mixture form and the variable partition.

Mots clés

Bayes factor; BIC; Linear regression; Model-based clustering; Variable selection

Lien vers l'élément du SI MIA

http://onlinelibrary.wiley.com/enhanced/exportCitation/doi/10.1111/j.1541-0420…

Auteur(s)

Maugis, C.

Celeux, G.

Martin-Magniette, M.-L.

Porteur(s)

Unité

MIA-Paris

Informations générales

Statut

À disposition

Tutoriel

http://www.math.univ-toulouse.fr/~maugis/image/SoftwaresEnclosures/Description…

Suivi

Maintenu

Informations spécifiques

Langage(s) de développement

C++

Langage(s) d'interface

C++

OS supporté

indifférent

Etat

Développement arrêté

Informations spécifiques

N° de version courante

Non renseigné

Informations spécifiques

Nombre de cœurs

cœurs

Informations spécifiques

Nombre de cœurs

cœurs

Nombre ETP permanent

ETP

Nombre non ETP permanent

ETP

SelvarClustMV (Apprentissage)

Variable selection in model-based clustering, taking into account missing values

It is devoted to the variable selection in model-based clustering, taking into account missing values. It is a greedy algorithm associated to the SR modeling proposed in Maugis et al. (Biometrics, 2009), taking into account missing values. This software allows to study data where individuals are described by quantitative block variables. It returns a data clustering and the selected model, composed of the number of clusters and the variable partition. This software is here only proposed for Gaussian mixtures whose variance matrices are assumed to be identical and free (m=[pkLC]).

Mots clés

Variable selection; Missing values; Model-based clustering

Lien vers l'élément du SI MIA

http://www.math.univ-toulouse.fr/~maugis/SelvarClustMVHomepage.html

Auteur(s)

Maugis-Rabusseau C.

Martin-Magniette M.-L.

Pelletier S.

Porteur(s)

Unité

MIA-Paris

Publication de référence

http://journal-sfds.fr/ojs/index.php/J-SFdS/article/view/119/109

Informations générales

Statut

À disposition

Suivi

Non renseigné

Informations spécifiques

Langage(s) de développement

C++

Langage(s) d'interface

C++

Etat

Développement arrêté

Informations spécifiques

N° de version courante

Non renseigné

Informations spécifiques

Nombre de cœurs

cœurs

Informations spécifiques

Nombre de cœurs

cœurs

Nombre ETP permanent

ETP

Nombre non ETP permanent

ETP

SelvarClustIndep (Apprentissage)

Variable selection in model-based clustering.

It is devoted to the variable selection in model-based clustering. It is a greedy algorithm associated to the SRUW modeling proposed by C.Maugis, G.Celeux and M.-L. Martin-Magniette in [1] and [2], modifying the method of Raftery and Dean [3] and improving our SelvarClust algorithm [4]. The SRUW modeling takes into account the three possible roles: relevant, redundant and independent variables.

This software allows to study datasets where observations are described by quantitative variables. It returns a data clustering and the selected model composed of the number of clusters, the mixture form, the variance matrix form for the linear regression and the independent Gaussian density, and the variable partition.

Mots clés

Independent Gaussian density

Lien vers l'élément du SI MIA

http://www.math.univ-toulouse.fr/~maugis/SelvarClustIndepHomepage.html

Porteur(s)

Unité

MIA-Paris

Publication de référence

http://dx.doi.org/10.1016/j.csda.2009.04.013

Informations complémentaires

Informations générales

Statut

À disposition

Suivi

Maintenu

Informations spécifiques

Langage(s) de développement

C++

Langage(s) d'interface

C++

OS supporté

indifférent

Etat

Développement arrêté

Informations spécifiques

N° de version courante

Non renseigné

Informations spécifiques

Nombre de cœurs

cœurs

Informations spécifiques

Nombre de cœurs

cœurs

Nombre ETP permanent

ETP

Nombre non ETP permanent

ETP

Dbmss (Données spatiale & écologie)

Tools to characterize point patterns.

Simple computation of spatial statistic functions of distance to characterize the spatial structures of mapped objects, including classical ones (Ripley's K and others) and more recent ones used by spatial economists (Duranton and Overman's Kd, Marcon and Puech's M). Relies on spatstat for some core calculation.

Mots clés

Spatial structure; point patterns

Lien vers l'élément du SI MIA

https://cran.r-project.org/package=dbmss

Auteur(s)

Eric Marcon

Gabriel Lang

Stephane Traissac

Florence Puech

Contact

Eric.Marcon@ecofog.gf

Porteur(s)

Unité

MIA-Paris

Publication de référence

http://dx.doi.org/10.18637/jss.v067.c03

Informations générales

Statut

À disposition

Suivi

Maintenu

Informations spécifiques

Langage(s) de développement

R

Langage(s) d'interface

R

N° de version courante

V2.2-5

Date de la version courante

2016-06-29

OS supporté

indifférent

Type de licence

GPLv2

Etat

Développement arrêté

Informations spécifiques

N° de version courante

Non renseigné

Informations spécifiques

Nombre de cœurs

cœurs

Informations spécifiques

Nombre de cœurs

cœurs

Nombre ETP permanent

ETP

Nombre non ETP permanent

ETP

AR1seg

Implementation of the robust approach for estimating change-points in the mean of an AR(1) Gaussian process

This package corresponds to the implementation of the robust approach for estimating change-points in the mean of an AR(1) Gaussian process by using the methodology described in the paper arXiv 1403.1958

Mots clés

Auto-regressive model; Change-points; Robust estimation of the AR(1) parameter; Time series; Model selection

Lien vers l'élément du SI MIA

https://cran.r-project.org/web/packages/AR1seg/index.html

Auteur(s)

S. Chakar

E. Lebarbier

C. Levy-Leduc

S. Robi

Contact

souhil.chakar@agroparistech.fr

Porteur(s)

Unité

MIA-Paris

Publication de référence

http://arxiv.org/abs/1403.1958

Informations générales

Statut

À disposition

Manuel de référence

https://cran.r-project.org/web/packages/AR1seg/AR1seg.pdf

Suivi

Maintenu

Informations spécifiques

Langage(s) de développement

R

Langage(s) d'interface

R

N° de version courante

V1.0

Date de la version courante

2014-06-05

OS supporté

indifférent

Type de licence

GPLv2

Etat

Développement arrêté

Informations spécifiques

N° de version courante

Non renseigné

Informations spécifiques

Nombre de cœurs

cœurs

Informations spécifiques

Nombre de cœurs

cœurs

Nombre ETP permanent

ETP

Nombre non ETP permanent

ETP

HiCseg

Two-dimensional segmentation for analyzing Hi-C data

Motivation: The spatial conformation of the chromosome has a deep influence on gene regulation and expression. Hi-C technology allows the evaluation of the spatial proximity between any pair of loci along the genome. It results in a data matrix where blocks corresponding to (self-)interacting regions appear. The delimitation of such blocks is critical to better understand the spatial organization of the chromatin. From a computational point of view, it results in a 2D segmentation problem.

Results: We focus on the detection of cis-interacting regions, which appear to be prominent in observed data. We define a block-wise segmentation model for the detection of such regions. We prove that the maximization of the likelihood with respect to the block boundaries can be rephrased in terms of a 1D segmentation problem, for which the standard dynamic programming applies. The performance of the proposed methods is assessed by a simulation study on both synthetic and resampled data. A comparative study on public data shows good concordance with biologically confirmed regions.

Availability and implementation: The HiCseg R package is available from the Comprehensive R Archive Network and from the Web page of the corresponding author.

Mots clés

Spatial proximity; Genome; Spatial organization

Lien vers l'élément du SI MIA

https://cran.r-project.org/web/packages/HiCseg/index.html

Auteur(s)

Contact

celine.levy-leduc@agroparistech.fr

Porteur(s)

Unité

MIA-Paris

Publication de référence

https://doi.org/10.1093/bioinformatics/btu443

Informations générales

Statut

À disposition

Suivi

Maintenu

Informations spécifiques

Langage(s) de développement

R

Langage(s) d'interface

R

N° de version courante

V1.1

Date de la version courante

2014-06-10

OS supporté

indifférent

Type de licence

GPLv2

Etat

Développement arrêté

Informations spécifiques

N° de version courante

Non renseigné

Informations spécifiques

Nombre de cœurs

cœurs

Informations spécifiques

Nombre de cœurs

cœurs

Nombre ETP permanent

ETP

Nombre non ETP permanent

ETP

Blockseg

Segments a matrix in blocks with constant values.

Détection de régions corrélées dans des données d’expression, prenant en compte des variations du nombre de copies .

Mots clés

Organization

Lien vers l'élément du SI MIA

https://cran.r-project.org/web/packages/blockseg/index.html

Auteur(s)

Julien Chiquet

Vincent Brault

Contact

julien.chiquet@gmail.com

Porteur(s)

Unité

MIA-Paris

Informations générales

Statut

À disposition

Manuel de référence

https://cran.r-project.org/web/packages/blockseg/blockseg.pdf

Suivi

Maintenu

Informations spécifiques

Langage(s) de développement

R

Langage(s) d'interface

R

N° de version courante

V0.2

Date de la version courante

2016-06-10

OS supporté

indifférent

Type de licence

GPLv2

Etat

Développement arrêté

Informations spécifiques

N° de version courante

Non renseigné

Informations spécifiques

Nombre de cœurs

cœurs

Informations spécifiques

Nombre de cœurs

cœurs

Nombre ETP permanent

ETP

Nombre non ETP permanent

ETP

SegCorr

Détection de régions corrélées dans des données d’expression, prenant en compte des variations du nombre de copies.

Performs correlation matrix segmentation and applies a test procedure to detect highly correlated regions in gene expression.

Mots clés

analysis of variance

Lien vers l'élément du SI MIA

https://cran.r-project.org/web/packages/SegCorr/index.html

Auteur(s)

Eleni Ioanna Delatola

Emilie Lebarbier

Tristan Mary-Huard

Francois Radvanyi

Stephane Robin

Jennifer Wong

Contact

eldelatola@yahoo.gr

Porteur(s)

Unité

MIA-Paris

Informations complémentaires

Informations générales

Statut

À disposition

Manuel de référence

https://cran.r-project.org/web/packages/SegCorr/SegCorr.pdf

Suivi

Maintenu

Informations spécifiques

Langage(s) de développement

R

Langage(s) d'interface

R

N° de version courante

V1.1

Date de la version courante

2015-11-04

OS supporté

indifférent

Type de licence

GPLv2

Etat

Développement arrêté

Informations spécifiques

N° de version courante

Non renseigné

Informations spécifiques

Nombre de cœurs

cœurs

Informations spécifiques

Nombre de cœurs

cœurs

Nombre ETP permanent

ETP

Nombre non ETP permanent

ETP

MixThres

MixThres est un package permettant la définition d’un seuil d’hybridation à partir de modèles de mélange sur la distribution d’un signal

Even if one of the major applications of two-color DNA microarray hybridizations is to detect differentially expressed genes using intensity log-ratios, single channel signals provide also useful information as absolute value measurements which allow the description of gene expression patterns. In this context, it becomes crucial to determine the set of probes that hybridize, that is for which the intensity signal is greater than a hybridization threshold to be fixed. Existing procedures are either an arbitrary thresholding or require the knowledge of a population of non-hybridized probes. In this work we present the MixThres method to determine an adaptive hybridization threshold from intensity levels of the complete set of probes hybridized on a chip. We define a hybridization threshold based on the histogram of the probe intensity values. Our procedure is divided into two steps. First the intensity distribution is estimated using mixture models. Second a hybridization threshold is defined from the components of the mixture. We validate our method on DNA tiling array and expression array data. We show that our method has a good reproducibility, its specificity is greater than 97 % and its precision of 88 %. The R package MixThres is available at http://www.agroparistech.fr/mia/outil.htm

Mots clés

Microarray; Gene expression patterns; Hybridization; Mixture Model

Lien vers l'élément du SI MIA

mixthres_1.0.zip

Auteur(s)

Julie Aubert

Marie-Laure Martin-Magniette

Contact

julie.aubert@agroparistech.fr

Porteur(s)

Unité

MIA-Paris

Equipe

Statistique et Génome

Département co-porteur

BAP

Publication de référence

non renseigné

Informations générales

Partenaire externe

aucun

Manuel de référence

MixThres: mixture models to define a hybridization threshold in DNA microarray …

Informations spécifiques

Langage(s) de développement

R

N° de version courante

V1.0

Date de la version courante

2008-10-01

OS supporté

indifférent

Type de licence

GPLv2

Informations spécifiques

N° de version courante

Non renseigné

Informations spécifiques

Nombre de cœurs

cœurs

Informations spécifiques

Nombre de cœurs

cœurs

Nombre ETP permanent

ETP

Nombre non ETP permanent

ETP

Résultats de la recherche

SelvarClust (Apprentissage)

SelvarClustMV (Apprentissage)

SelvarClustIndep (Apprentissage)

Dbmss (Données spatiale & écologie)

AR1seg

HiCseg

Blockseg

SegCorr

MixThres

Système d'information scientifique MIA classé par unité (UR, UMR)