- [Nov 2016] Two journal papers on two core aspects of Archimedes Probabilistic Knowledge Base System were published in the VLDB Journal: 1) Inference: In-Database Batch and Query-time Inference over Probabilistic Graphical Models using UDA-GIST, by Kun Li*, Xiaofeng Zhou*, Daisy Zhe Wang, Christan Grant, Alin Dobra, Christopher Dudley and 2) Learning: ScaLeKB: Scalable Learning and Inference over Large Knowledge Bases, by Yang Chen, Daisy Zhe Wang, Sean Goldberg.
- [Sept 2016] NIST sponsors UF faculty Prof. Daisy Zhe Wang and Prof. Ethan White as PIs for developing a new Data Science Evaluation track for Fall 2017 on Data Science for Plant Identification with Remote Sensing data from National Ecological Observatory Network (Neon).
- [August 2016] Two system demo papers 1) ArchimedesOne: Query Processing over Probabilistic Knowledge Bases, by Xiaofeng Zhou, Yang Chen, Daisy Zhe Wang and 2) SigmaKB: Multiple Uncertain Knowledge Base Fusion, by Miguel E. Rodríguez, Sean Goldberg, Daisy Zhe Wang are presented at the 2016 VLDB conference in New Delhi, India.
- [July 2016] Our work on Multimodal Ensemble Fusion for Disambiguation and Retrieval by Yang Peng et. al. has been accepted as a journal paper to the IEEE Multimedia Magazine.
- [May 2016] Prof Daisy Zhe Wang visited Computer Science at the University of Washington to give a talk as part of the North West Database Society series of talks on Archimedes: A Probabilistic Master Knowledge Base System.
- [March 2016] UF DSR Lab is invited to participate the NIST Data Science pre-pilot evaluation workshop 2016 and will be presenting (1) the results of the 2015 NIST Data Science pre-pilot evaluation participation from UF and (2) a proposal of a new Data Science evaluation on Computational Ecology using remote sensing and data from the NSF Neon program.
- [March 2016] Consensus Maximization Fusion of Probabilistic Information Extractors by Miguel Rodriguez et. al is accepted at HTL NAACL 2016. This CMF algorithm participated in the TAC KBP SVF evaluation organized by NIST in 2015 and achieved top 3 ranked results in CSSF/CSKB and overall ensemble runs.
- [Feb 2016] Prof. Daisy Zhe Wang visited Computer Science at University of Miami, Information Sciences Institute at University of South California and gave talks on different aspects of Archimedes. I also visited UC Irvine to discuss research projects.
- [Jan 2016] The NLP expertise in the UF DSR lab was drawn upon by UF CTSI and supporting the newly funded OneFlorida Clinical Research Consortium, which was recently designated as one of the nation’s 13 clinical data research networks, or CDRNs, by the Patient-Centered Outcomes Research Institute (PCORI) to accelerate the translation of promising research findings into improved patient care.
- [Spring 2016] Prof. Daisy Zhe Wang is advising four student projects in CAP4773/CAP6779 Project in Data Science: (1) contributing to Apache MADlib; (2) Legal citation graph analytics and Case predictions; (3) automatically extracting biomedical knowledge bases; and (4) distributed RDF store for query processing over large knowledge bases.
- [Nov 2015] Ontological Pathfinding: Mining First-Order Knowledge from Large Knowledge Bases by Yang Chen et. al is accepted at SIGMOD 2016.
- [October, 2015] The MADlib, open-source library for scalable in-database analytics, is now an Apache Software Foundation Incubator project: MADlib@ASF. We are excited to continue our contribution!
- [September, 2015] Our research on “Efficient Query Processing over Large Probabilistic Knowledge Bases” is funded for 3 years by NSF IIS Div. of Information & Intelligent Systems.
- [August, 2015] As par of the University of Florida Engineering team, Prof. Daisy Zhe Wang visited the Harris Corporation, presented and discussed past and future research and development projects at the Harris Technology Center.
- [August, 2015] Congratulations to Dr. Christan Grant on successfully defended his Ph.D. thesis on “Query-Driven Text Analytics for Knowledge Extraction, Resolution, and Inference“. Best of luck starting the Assistant Professorship at the University of Oklahoma in Data Science!
- [May, 2015] Congratulations to Dr. Kun Li on successfully defending his Ph.D. thesis on “In-Database Large-Scale Statistical Data Analysis“. Best of luck heading over to Google!
- [April, 2015] Prof. Daisy Zhe Wang visited Bay Area and gave talks at UC Berkeley AMP Lab Seminar and Google Research on “Archimedes: A Probabilistic Master Knowledge Base System”.
- [Feb, 2015] Harris Corporation provided a seed fund to UF DSR group to conduct a Research Excellence Endowment Project, in which Archimedes is the targeted smart big data engine to be implemented over the Gator SmartCloud. This is a collaboration with Prof. Xiaolin (Andy) Li from ECE.
- [Jan, 2015] Sean Goldberg has been selected as a new Sandia Campus Exec Fellow at UF from 2015 to 2017. Data Science in one of the recently identified Sandia Research Challenges. Congratulations to Sean!
- [Nov, 2014] Our paper UDA-GIST: An In-database Framework to Unify Data-Parallel and State-Parallel Analytics is accepted at VLDB, 2015.
- [Sept, 2014] We published an IEEE Bulletin journal paper describing the big picture of our on-going efforts in extending databases to support Efficient In-Database Analytics with Graphical Models.
- [June, 2014] Our paper Knowledge Expansion over Probabilistic Knowledge Bases is presented at The ACM SIGMOD International Conference on Management of Data, 2014.
- [Apr 16, 2014] Our paper Exploring Netflow Data using Hadoop is accepted by The Third ASE International Conference on Cyber Security, 2014.
- [Jan 6, 2014] Three-Course Data Science Curriculum @ UF CISE starts Spring 2014 with a first course in the series — Introduction to Data Science. For more information, please refer to the Dec 2013 blog post on this.
- [Sep 4, 2013] Our work on Crowd-Assisted Text Labeling and Extraction is accepted by the First AAAI Conference on Human Computation and Crowdsourcing (HCOMP-2013).
- [Aug 12, 2013] Our work on Knowledge Expansion using Inference over large-scale Uncertain Knowledge Bases in the ProbKB project received Google Faculty Research Award.
- [Jun 15, 2013] Our papers GPText: Greenplum Parallel Statistical Text Analysis Framework, Web-Scale Knowledge Inference Using Markov Logic Networks, and Knowledge Extraction and Knowledge Extraction and Outcome Prediction using Medical Notes are accepted by ICML SLG, ICML WHealth and SIGMOD DanaC workshops.
- [April 22, 2013] The First Data Science Exposition was successfully held in conjunction with UF CISE Computer Science Day. Four groups received prizes sponsored by Google. The course computing infrastructure is sponsored by Amazon Web Services (AWS).
- [Feb 22, 2013] Data science research group extravaganza in honor of Dr. Pedro Domingo‘s visit.
- [Jan 14, 2013] Our group has established collaboration with ICAIR and the UF Law School in the E-Discovery Project to improve legal review process using machine learning techniques. For details on the E-discovery project, please refer to:http://www.law.ufl.edu/academics/institutes/icair.
- [Nov 6, 2012] As part of the CUBISM (IHMC/UA/UF), we participated in the DARPA DEFT kick-off meeting in Tampa. For details of the CUBISM/DEFT, please refer to:http://www.ihmc.us/news/20121101.php
- Dr. Bonnie Dorr, IHMC research group visited us; we had a fruitful discussion on the frontiers of NLP, knowledge bases and conversation systems.
- [Jul 16, 2012] Our paper MADden: Query-Driven Statistical Text Analytics has been accepted by the CIKM 2012 demo session.
- [Jun 7, 2012] We gave a talk on our poster Automatic Knowledge Base Construction using Probabilistic Extraction, Deductive Reasoning, and Human Feedback in AKBC-WEKEX 2012, Montreal, Canada.
- Our paper The MADlib Analytics Library or MAD Skills, the SQL was accepted byVLDB 2012.
- [Apr 24, 2012] Our paper Automatic Knowledge Base Construction Using Probabilistic Extraction, Deductive Reasoning, and Human Feedback was accepted as a poster by AKBC-WEKEX 2012.