Lambda architecture on Apache Spark, Apache Kafka for real-time
DSTK - DataScience ToolKit for All of Us
Supervised Ranking of Contigs in de novo Assemblies
Workflow Designer, Hive Editor, Pig Editor, File System Browser
Non-disjoint groupping of Documents based on word sequence approach