Derek Anderson, James Keller and I have conducted a back-to-back tutorials on "Computer Vision: A Computational Intelligence Perspective", at WCCI'16, Vancouver, Canada. Thank you for your supports. Here are the slides - [Slide - Part I] [Slide - Part II]

    A picture is worth a thousand words. Not until recently, however, we noticed some success stories in understanding of visual scenes: a model that is capable of to detect/name objects, describe their attributes, and recognize their relationships/interactions. In this paper, we propose a phrase-based hierarchical Long Short-Term Memory (phi-LSTM) model to generate image description. The proposed model encodes sentence as a sequence of combination of phrases and words, instead of a sequence of words alone as in those conventional solutions. The two levels of this model are dedicated to i) learn to generate image relevant noun phrases, and ii) produce appropriate image description from the phrases and other words in the corpus. Adopting the convolutional neural network to learn image features and the LSTM to learn word sequence in a sentence, the proposed model has shown better or competitive results in comparison to the state-of-the-art models on Flickr8k and Flickr30k datasets.

    Crowd segmentation is important in serving as the basis for a wide range of crowd analysis tasks such as density estimation and behavior understanding. However, due to inter-occlusions, perspective distortion, clutter background and random crowd distribution, localizing crowd segments is technically a very challenging task. This paper proposes a novel crowd segmentation framework based on granular computing (GrCS) to enable the problem of crowd segmentation be conceptualized at different levels of granularity, and to map problems into computationally tractable subproblems. It shows that by exploiting the correlation among pixel granules, we are able to aggregate structurally similar pixels into meaningful atomic structure granules. This is useful in outlining natural boundaries between crowd and background (i.e. non-crowd) regions. From the structure granules, we infer the crowd and background regions by granular information classification. GrCS is scene-independent, and can be applied effectively to crowd scenes with a variety of physical layout and crowdedness. Extensive experiments have been conducted on hundreds of real and synthetic crowd scenes. The results demonstrate that by exploiting the correlation among granules, we can outline the natural boundaries of structurally similar crowd and background regions necessary for crowd segmentation.

    Although the traits emerged in a mass gathering are often non-deliberative, the act of mass impulse may lead to irrevocable crowd disasters. The two-fold increase of carnage in crowd since the past two decades has spurred significant advances in the field of computer vision, towards effective and proactive crowd surveillance. Computer vision studies related to crowd are observed to resonate with the understanding of the emergent behavior in physics (complex systems) and biology (animal swarm). These studies, which are inspired by biology and physics, share surprisingly common insights, and interesting contradictions. However, this aspect of discussion has not been fully explored. Therefore, this survey provides the readers with a review of the state-of-the-art methods in crowd behavior analysis from the physics and biologically inspired perspectives. We provide insights and comprehensive discussions for a broader understanding of the underlying prospect of blending physics and biology studies in computer vision.

    C.H. Lim, E. Vats & C.S. Chan, Fuzzy Human Motion Analysis: A Review, Pattern Recognition, 2015

        First fuzzy review paper in human motion analysis with 252 references

        PDF

    Human Motion Analysis (HMA) is currently one of the most popularly active research domains as such significant research interests are motivated by a number of real world applications such as video surveillance, sports analysis, healthcare monitoring and so on. However, most of these real world applications face high levels of uncertainties that can affect the operations of such applications. Hence, the fuzzy set theory has been applied and showed great success in the recent past. In this paper, we aim at reviewing the fuzzy set oriented approaches for HMA, individuating how the fuzzy set may improve the HMA, envisaging and delineating the future perspectives. To the best of our knowledge, there is not found a single survey in the current literature that has discussed and reviewed fuzzy approaches towards the HMA. For ease of understanding, we conceptually classify the human motion into three broad levels: Low-Level (LoL), Mid-Level (MiL), and High-Level (HiL) HMA.

August 2016:
I will be giving a TEDx talk on the 29 October 2016. (Youtube video teaser)
August 2016:
One(1) paper accepted in ACCV'16, Taipei, Taiwan.
July 2016:
One(1) paper accepted in ICPR'16, Cancun, Mexico.
May 2016:
One(1) paper accepted in ICIP'16, Phoenix, USA.
March 2016:
Two(2) papers accepted in FUZZ-IEEE'16, Vancouver, Canada.
March 2016:
Paper accepted in IEEE Transactions on Fuzzy Systems.

In this paper, a new formula of fuzzy Caputo fractional-order derivatives (0 < v ≤ 1) in terms of shifted Chebyshev polynomials is derived. The proposed approach introduces shifted Chebyshev operational matrix in combination with shifted Chebyshev tau technique for the numerical solution of linear fuzzy fractional order differential equations. The main advantage of the propose approach is that it simplifies the problem alike in solving a system of fuzzy algebraic linear equation. An approximated error bound between the exact solution and the proposed fuzzy solution with respect to the number of fuzzy rules and solution errors is derived. Furthermore, we also discuss the convergence of the proposed method from the fuzzy perspective. Experimentally, we show the strength of the proposed method in solving a variety of FDE models under uncertainty encountered in engineering and physical phenomena (i.e viscoelasticity, oscillations and Resistor-Capacitor (RC) circuits). Comparisons are also made with solutions obtained by the Laguerre polynomials and fractional Euler method.

PDF (Coming Soon)

Older News Archive    [ click to Expand ]
Crowd Dataset

The dataset is used in our ICPR'14 paper.

Details ....
Malaya Abrupt Motion (MAMo) Dataset

The tracking dataset is used in our Info. Sci.'14 paper.

Details ....
Pratheepan Human Skin Detection Dataset

The dataset is used in our IEEE T-II'12 paper.

Details ....