1、 docker in Bio/informatics署名:黄泽辉职称:华大基因互联网产品总监构建基因数据应用生态系统构建基因数据应用生态系统一、基因组学数据与分析应用二、存在的问题与挑战三、Docker与应用标准化四、如何构建基因“App Store”目 录content一、基因组学数据与分析应用ExpressionStructurePhenotypeSequenceInteractions二、存在的问题与挑战http:/ of software and hardware environmentsfrom workstations to cluster,cloud,and high perfo
2、rmance computing(HPC)environments.version control三、Docker与应用标准化1.使用预配置docker镜像提供运行时环境-轻量化-简单易用-稳定可重复-不区分开发语言和操作系统-版本化控制2.统一的CLI(执行)描述语言-Tool Wrapper/Tool specification-Linked-Data-兼容POSIX标准3.统一流程(DAG)描述语言-使用工具或者子流程构建流程-定义任务依赖关系-数据Streaming-任务并行-动态生成作业4.提供完整的工具链(support tooling)-工具开发(author)-调试(debug
3、)-运行(scheduler)4.利用Dockerfile&Docker Registry分发软件-简便的分发和使用-Dockerfile构建镜像DockstoreA tool consists of:a(currently Docker)image a document that describes how to use that image(currently CWL or WDL)a Dockerfile that describes how to re-produce the image in the future四、如何构建基因“App Store”UserDeveloperDock
4、erfileTool.cwlBuild SystemDocker ImageApp StoreworkloadJSON parameterization fileGA4GH-compliant workflow execution service.Eclipse Che-在线开发工具Visual editorTool registry apiWorkflow execution api-ability to request a workflow run using CWL or WDL(and maybe future formats)-ability to parameterize that workflow using a JSON schema thats simple and used in common between CWL and WDL-ability to get information about running workflows,status,errors,output file locations etc