SMiLe CLiNiC research aims to develop flexible yet efficient probabilistic learning methods that can take into account diverse statistical features of real-world data. While machine learning methods have emerged as one of the most promising statistical frameworks for addressing real-world challenges, many of the current models and algorithms are too restrictive for capturing complex tasks, and are challenging for massive scale applications. In the latest study by IDC (J. Gantz and D. Reinsel. The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East.), it is forecasted that 40 ZettaBytes of digital content were created or replicated by 2020, and as much as 33% of the digital universe will contain information that might be valuable if analysed.
Complex statistical features are commonplace for those data; SMiLe CLiNiC focuses on the heterogeneity or multi-modality of the data. Phrased differently data that has textual, imaging, and network relationship representations and comes from multiple sources. Being in a real-world setting, the multi-modality challenge turns into a multi-faceted problem containing the following three major sub-problems: i) dynamic time evolution (data is collected over a long time period); ii) rich interdependency structures (in text, not only words but also how words combine into a sentence carry relevant information); iii) output inconsistencies (information mismatch between or within data sources). Scientific and societal challenges of the 21st century are to draw insights and make predictions to support knowledge creation from this massive amount of data.
SMiLe CLiNiC is part of the Sussex Informatics department, affiliated with Data Science @ Sussex and based in the Sussex campus set on the edge of the beautiful South Downs National Park. Key facts about Sussex: 1) Top 10 in the UK - 60th in the world for research influence (Times Higher Education, World University Rankings 2013-2014), 2) The sunniest part of UK, 3) Less than half an hour by cycle to the Brighton Beach, 4) The 6th safest university cities and towns.
Novi is smiling about nonparametric machine learning methods for big data.
Viktoriia is smiling about machine learning methods for computer vision.
Sri is approximately smiling about time series forecasting.
Pietro is smiling about scalable human-in-the-loop nonparametric Bayesian methods.
David is smiling about Twitter demographic profiling.
Joe is smiling about defining what privileged information is.
Tom is smiling about making Sussex Machine Learning Teaching Aid.
Rosemary is smiling about applying machine learning and statistical techniques to large biomedical datasets.
International Conference on Machine Learning, Helsinki, Finland, July 2008
International Conference on Machine Learning, Bellevue, Washington, US, June 2011
International Conference on Machine Learning, Edinburgh, Scotland, UK, June 2012