All Publications | Signal Processing Group

2016

Valera, Isabel; Ruiz, Francisco J R; Perez-Cruz, Fernando

Infinite Factorial Unbounded-State Hidden Markov Model Artículo de revista

En: IEEE transactions on pattern analysis and machine intelligence, vol. 38, no 9, pp. 1816 – 1828, 2016, ISSN: 1939-3539.

Resumen | Enlaces | BibTeX | Etiquetas: Bayes methods, Bayesian nonparametrics, CASI CAM CM, Computational modeling, GAMMA-L+ UC3M, Gibbs sampling, Hidden Markov models, Inference algorithms, Journal, Markov processes, Probability distribution, reversible jump Markov chain Monte Carlo, slice sampling, Time series, variational inference, Yttrium

@article{Valera2016b,

title = {Infinite Factorial Unbounded-State Hidden Markov Model},

author = {Isabel Valera and Francisco J R Ruiz and Fernando Perez-Cruz},

url = {http://www.ncbi.nlm.nih.gov/pubmed/26571511 http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true\&amp;arnumber=7322279},

doi = {10.1109/TPAMI.2015.2498931},

issn = {1939-3539},

year  = {2016},

date = {2016-09-01},

journal = {IEEE transactions on pattern analysis and machine intelligence},

volume = {38},

number = {9},

pages = {1816 -- 1828},

abstract = {There are many scenarios in artificial intelligence, signal processing or medicine, in which a temporal sequence consists of several unknown overlapping independent causes, and we are interested in accurately recovering those canonical causes. Factorial hidden Markov models (FHMMs) present the versatility to provide a good fit to these scenarios. However, in some scenarios, the number of causes or the number of states of the FHMM cannot be known or limited a priori. In this paper, we propose an infinite factorial unbounded-state hidden Markov model (IFUHMM), in which the number of parallel hidden Markov models (HMMs) and states in each HMM are potentially unbounded. We rely on a Bayesian nonparametric (BNP) prior over integer-valued matrices, in which the columns represent the Markov chains, the rows the time indexes, and the integers the state for each chain and time instant. First, we extend the existent infinite factorial binary-state HMM to allow for any number of states. Then, we modify this model to allow for an unbounded number of states and derive an MCMC-based inference algorithm that properly deals with the trade-off between the unbounded number of states and chains. We illustrate the performance of our proposed models in the power disaggregation problem.},

keywords = {Bayes methods, Bayesian nonparametrics, CASI CAM CM, Computational modeling, GAMMA-L+ UC3M, Gibbs sampling, Hidden Markov models, Inference algorithms, Journal, Markov processes, Probability distribution, reversible jump Markov chain Monte Carlo, slice sampling, Time series, variational inference, Yttrium},

pubstate = {published},

tppubtype = {article}

}

Cerrar

Valera, Isabel; Ruiz, Francisco J R; Perez-Cruz, Fernando

Infinite Factorial Unbounded-State Hidden Markov Model Artículo de revista

En: IEEE transactions on pattern analysis and machine intelligence, vol. To appear, no 99, pp. 1, 2016, ISSN: 1939-3539.

Resumen | Enlaces | BibTeX | Etiquetas: Bayes methods, Bayesian nonparametrics, CASI CAM CM, Computational modeling, GAMMA-L+ UC3M, Gibbs sampling, Hidden Markov models, Inference algorithms, Markov processes, Probability distribution, reversible jump Markov chain Monte Carlo, slice sampling, Time series, variational inference, Yttrium

@article{Valera2016c,

title = {Infinite Factorial Unbounded-State Hidden Markov Model},

author = {Isabel Valera and Francisco J R Ruiz and Fernando Perez-Cruz},

url = {http://www.ncbi.nlm.nih.gov/pubmed/26571511 http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true\&amp;arnumber=7322279},

doi = {10.1109/TPAMI.2015.2498931},

issn = {1939-3539},

year  = {2016},

date = {2016-01-01},

journal = {IEEE transactions on pattern analysis and machine intelligence},

volume = {To appear},

number = {99},

pages = {1},

abstract = {There are many scenarios in artificial intelligence, signal processing or medicine, in which a temporal sequence consists of several unknown overlapping independent causes, and we are interested in accurately recovering those canonical causes. Factorial hidden Markov models (FHMMs) present the versatility to provide a good fit to these scenarios. However, in some scenarios, the number of causes or the number of states of the FHMM cannot be known or limited a priori. In this paper, we propose an infinite factorial unbounded-state hidden Markov model (IFUHMM), in which the number of parallel hidden Markov models (HMMs) and states in each HMM are potentially unbounded. We rely on a Bayesian nonparametric (BNP) prior over integer-valued matrices, in which the columns represent the Markov chains, the rows the time indexes, and the integers the state for each chain and time instant. First, we extend the existent infinite factorial binary-state HMM to allow for any number of states. Then, we modify this model to allow for an unbounded number of states and derive an MCMC-based inference algorithm that properly deals with the trade-off between the unbounded number of states and chains. We illustrate the performance of our proposed models in the power disaggregation problem.},

keywords = {Bayes methods, Bayesian nonparametrics, CASI CAM CM, Computational modeling, GAMMA-L+ UC3M, Gibbs sampling, Hidden Markov models, Inference algorithms, Markov processes, Probability distribution, reversible jump Markov chain Monte Carlo, slice sampling, Time series, variational inference, Yttrium},

pubstate = {published},

tppubtype = {article}

}

Cerrar

Borchani, Hanen; Larrañaga, Pedro; Gama, J; Bielza, Concha

Mining multi-dimensional concept-drifting data streams using Bayesian network classifiers Artículo de revista

En: Intelligent Data Analysis, vol. 20, 2016.

Enlaces | BibTeX | Etiquetas: CASI CAM CM, CIG UPM, Journal

2015

Valera, Isabel; Ruiz, Francisco J R; Svensson, Lennart; Perez-Cruz, Fernando

Infinite Factorial Dynamical Model Proceedings Article

En: Advances in Neural Information Processing Systems, pp. 1657–1665, Montreal, 2015.

Resumen | Enlaces | BibTeX | Etiquetas: CASI CAM CM, GAMMA-L+ UC3M

Mihaljević, Bojan; Benavides-Piccione, Ruth; Guerra, Luis; DeFelipe, Javier; Larrañaga, Pedro; Bielza, Concha

Classifying GABAergic interneurons with semi-supervised projected model-based clustering. Artículo de revista

En: Artificial intelligence in medicine, vol. 65, no 1, pp. 49–59, 2015, ISSN: 1873-2860.

Resumen | Enlaces | BibTeX | Etiquetas: Automatic neuron classification, CASI CAM CM, Cerebral cortex, CIG UPM, Gaussian mixture models, Journal, Semi-supervised projected clustering

@article{Mihaljevic2015,

title = {Classifying GABAergic interneurons with semi-supervised projected model-based clustering.},

author = {Bojan Mihaljevi\'{c} and Ruth Benavides-Piccione and Luis Guerra and Javier DeFelipe and Pedro Larra\~{n}aga and Concha Bielza},

url = {http://www.aiimjournal.com/article/S0933365714001481/fulltext http://cig.fi.upm.es/articles/2015/Mihaljevic-2015-AIIM.pdf},

doi = {10.1016/j.artmed.2014.12.010},

issn = {1873-2860},

year  = {2015},

date = {2015-09-01},

journal = {Artificial intelligence in medicine},

volume = {65},

number = {1},

pages = {49--59},

publisher = {Elsevier},

abstract = {OBJECTIVES: A recently introduced pragmatic scheme promises to be a useful catalog of interneuron names. We sought to automatically classify digitally reconstructed interneuronal morphologies according to this scheme. Simultaneously, we sought to discover possible subtypes of these types that might emerge during automatic classification (clustering). We also investigated which morphometric properties were most relevant for this classification. MATERIALS AND METHODS: A set of 118 digitally reconstructed interneuronal morphologies classified into the common basket (CB), horse-tail (HT), large basket (LB), and Martinotti (MA) interneuron types by 42 of the world's leading neuroscientists, quantified by five simple morphometric properties of the axon and four of the dendrites. We labeled each neuron with the type most commonly assigned to it by the experts. We then removed this class information for each type separately, and applied semi-supervised clustering to those cells (keeping the others' cluster membership fixed), to assess separation from other types and look for the formation of new groups (subtypes). We performed this same experiment unlabeling the cells of two types at a time, and of half the cells of a single type at a time. The clustering model is a finite mixture of Gaussians which we adapted for the estimation of local (per-cluster) feature relevance. We performed the described experiments on three different subsets of the data, formed according to how many experts agreed on type membership: at least 18 experts (the full data set), at least 21 (73 neurons), and at least 26 (47 neurons). RESULTS: Interneurons with more reliable type labels were classified more accurately. We classified HT cells with 100% accuracy, MA cells with 73% accuracy, and CB and LB cells with 56% and 58% accuracy, respectively. We identified three subtypes of the MA type, one subtype of CB and LB types each, and no subtypes of HT (it was a single, homogeneous type). We got maximum (adapted) Silhouette width and ARI values of 1, 0.83, 0.79, and 0.42, when unlabeling the HT, CB, LB, and MA types, respectively, confirming the quality of the formed cluster solutions. The subtypes identified when unlabeling a single type also emerged when unlabeling two types at a time, confirming their validity. Axonal morphometric properties were more relevant that dendritic ones, with the axonal polar histogram length in the [$pi$, 2$pi$) angle interval being particularly useful. CONCLUSIONS: The applied semi-supervised clustering method can accurately discriminate among CB, HT, LB, and MA interneuron types while discovering potential subtypes, and is therefore useful for neuronal classification. The discovery of potential subtypes suggests that some of these types are more heterogeneous that previously thought. Finally, axonal variables seem to be more relevant than dendritic ones for distinguishing among the CB, HT, LB, and MA interneuron types.},

keywords = {Automatic neuron classification, CASI CAM CM, Cerebral cortex, CIG UPM, Gaussian mixture models, Journal, Semi-supervised projected clustering},

pubstate = {published},

tppubtype = {article}

}

Cerrar

OBJECTIVES: A recently introduced pragmatic scheme promises to be a useful catalog of interneuron names. We sought to automatically classify digitally reconstructed interneuronal morphologies according to this scheme. Simultaneously, we sought to discover possible subtypes of these types that might emerge during automatic classification (clustering). We also investigated which morphometric properties were most relevant for this classification. MATERIALS AND METHODS: A set of 118 digitally reconstructed interneuronal morphologies classified into the common basket (CB), horse-tail (HT), large basket (LB), and Martinotti (MA) interneuron types by 42 of the world's leading neuroscientists, quantified by five simple morphometric properties of the axon and four of the dendrites. We labeled each neuron with the type most commonly assigned to it by the experts. We then removed this class information for each type separately, and applied semi-supervised clustering to those cells (keeping the others' cluster membership fixed), to assess separation from other types and look for the formation of new groups (subtypes). We performed this same experiment unlabeling the cells of two types at a time, and of half the cells of a single type at a time. The clustering model is a finite mixture of Gaussians which we adapted for the estimation of local (per-cluster) feature relevance. We performed the described experiments on three different subsets of the data, formed according to how many experts agreed on type membership: at least 18 experts (the full data set), at least 21 (73 neurons), and at least 26 (47 neurons). RESULTS: Interneurons with more reliable type labels were classified more accurately. We classified HT cells with 100% accuracy, MA cells with 73% accuracy, and CB and LB cells with 56% and 58% accuracy, respectively. We identified three subtypes of the MA type, one subtype of CB and LB types each, and no subtypes of HT (it was a single, homogeneous type). We got maximum (adapted) Silhouette width and ARI values of 1, 0.83, 0.79, and 0.42, when unlabeling the HT, CB, LB, and MA types, respectively, confirming the quality of the formed cluster solutions. The subtypes identified when unlabeling a single type also emerged when unlabeling two types at a time, confirming their validity. Axonal morphometric properties were more relevant that dendritic ones, with the axonal polar histogram length in the [$pi$, 2$pi$) angle interval being particularly useful. CONCLUSIONS: The applied semi-supervised clustering method can accurately discriminate among CB, HT, LB, and MA interneuron types while discovering potential subtypes, and is therefore useful for neuronal classification. The discovery of potential subtypes suggests that some of these types are more heterogeneous that previously thought. Finally, axonal variables seem to be more relevant than dendritic ones for distinguishing among the CB, HT, LB, and MA interneuron types.

Cerrar

Borchani, Hanen; Varando, Gherardo; Bielza, Concha; Larrañaga, Pedro

A survey on multi-output regression Artículo de revista

En: Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol. 5, no 5, pp. 216–233, 2015, ISSN: 19424787.

Resumen | Enlaces | BibTeX | Etiquetas: algorithm adaptation methods, CASI CAM CM, CIG UPM, Journal, Multi-output regression, multi-target regression, performance evaluation measure, problem transformation methods

Varando, Gherardo; Bielza, Concha; Larrañaga, Pedro

Decision functions for chain classifiers based on Bayesian networks for multi-label classification Artículo de revista

En: International Journal of Approximate Reasoning, 2015, ISSN: 0888613X.

Resumen | Enlaces | BibTeX | Etiquetas: CASI CAM CM, CIG UPM, Journal

Ruiz, Francisco J R; Perez-Cruz, Fernando

A Generative Model for Predicting Outcomes in College Basketball Artículo de revista

En: Journal of Quantitative Analysis in Sports, vol. 11, no 1 Special Issue, pp. 39–52, 2015, ISSN: 1559-0410.

Resumen | Enlaces | BibTeX | Etiquetas: CASI CAM CM, GAMMA-L+ UC3M, Journal, NCAA tournament, Poisson factorization, Probabilistic modeling, variational inference

Varando, Gherardo; López-Cruz, Pedro L; Nielsen, Thomas D; Larrañaga, Pedro; Bielza, Concha

Conditional Density Approximations with Mixtures of Polynomials Artículo de revista

En: International Journal of Intelligent Systems, vol. 30, no 3, pp. 236–264, 2015, ISSN: 08848173.

Resumen | Enlaces | BibTeX | Etiquetas: CASI CAM CM, CIG UPM, Journal

Ruiz, Francisco J R; Perez-Cruz, Fernando

A Generative Model for Predicting Outcomes in College Basketball Artículo de revista

En: Journal of Quantitative Analysis in Sports, vol. 11, no 1 Special Issue, pp. 39–52, 2015, ISSN: 1559-0410.

Resumen | Enlaces | BibTeX | Etiquetas: CASI CAM CM, GAMMA-L+ UC3M, NCAA tournament, Poisson factorization, Probabilistic modeling, variational inference

Varando, Gherardo; Bielza, Concha; Larrañaga, Pedro

Decision boundary for discrete Bayesian network classifiers Artículo de revista

En: Journal of Machine Learning Research, 2015.

Resumen | Enlaces | BibTeX | Etiquetas: Bayesian networks, CASI CAM CM, CIG UPM, decision boundary, Journal, Lagrange basis, polynomial, supervised classication, threshold function