Research scientist in the field of data Management / big data / bioinformatics

 StepStone ·  25.10.2018 ·  Jülich ·  Forschungszentrum Jülich GmbH

As a member of the Helmholtz Association, Forschungszentrum Jülich makes an effective contribution to solving major challenges facing society in the fields of information, energy, and bioeconomy. It focuses on various tasks in the area of research management and utilizes large, often unique, scientific infrastructure. Come and work with around 5,900 colleagues across a range of topics and disciplines at one of Europe's largest research centres.

The Institute of Bio- and Geosciences – Plant Sciences (IBG-2) develops integrated bioeconomic concepts for the sustainable intensification of crop production. The aim is to improve yield, adjust quality to various forms of use (groceries, animal feed, raw materials, bioenergy), close nutrient cycles, and adapt processes to future climate and production conditions. IBG-2 is a world leader in the field of plant phenotyping, driven by its outstanding expertise in explaining the dynamic interaction between plants and the environment, combined with technological development, engineering and bioinformatics. Within this context, the institute develops and operates relevant next-generation infrastructure platforms and technologies and makes them available to external users (DPPN, EPPN2020, IPPN, EMPHASIS).

Infrastructure platforms for plant phenotyping produce large volumes of data, which require sound management to ensure they are available, accessible, interoperable and reusable for the scientific community. Within the scope of the EPPN 2020, Emphasis, and EOSC-Life projects, IBG-2 is working together with international partners on a pan-European e-infrastructure and is developing data management strategies, tools and demonstrators.

We are looking to recruit a

Research scientist in the field of data Management / big data / bioinformatics

The aims of the position are: (i) to structure and standardize existing data to enable interoperability; (ii) to prepare metadata in accordance with the FAIR principles; and (iii) to develop software that is capable of handling large volumes of scientific data, adapting itself to evolving data models, and enabling useful access in the form of a standard application programming interface (API) and a graphical user interface (GUI). Furthermore, a key function of the position is to develop and define data models, which will be aggregated in ontologies. With the aid of automatic code generators, the aforementioned standard API and GUI should then be automatically generated for such a set of data models. Data storage must be capable of integrating various technologies and supporting distribution.

Your main task will be the coordination and strategic scientific development of software tools (code generators) and frameworks for the generation and provision of installations of large data integration platforms. You will also interact with an interdisciplinary group of scientists that generales diverse very different data or uses this data data for their research activities. An important aspect of the position is the communication and, above all, the information and data flow between various employees in these projects and additional users within the European plant phenotyping community. You will need to understand the structure of the obtained data and identify what parts of the data are used by employees in order to distinguish between raw data and data to be processed. In addition, the raw data must be sufficiently secured to enable future analysis using innovative methods; the data to be processed must be structured and presented in appropriate data models. The exchange and integration of data is crucial to enable meta-analyses and collaboration between various research groups and disciplines. You should therefore have sound knowledge of web development technology and, in particular, data storage frameworks. Team leadership skills, communication skills and project management expertise are essential for overseeing the efficient development of software tools that automatically generate standardized API and GUI access to a range of data models.

university degree, preferably a doctoral degree, in informatics or the natural sciences with relevant and proven experience in the field of work

sound knowledge of and experience in scientific issues related to big data

experience in team and project management, preferably in an international context

very good experience in the design, development and implementation of requirement-oriented, complex web applications, database applications and storage technologies using conventional programming languages and tools on various platforms; preferably in a biological or scientific context

knowledge of and experience in distributed computing and/or the parallelization of applications (workload managers, Pachyderm framework)

experience in and knowledge of Node.js web development and at least one single-page application JavaScript framework (Vue.js, React or AngularJS)

experience in an additional programming language (R, Scala, Python, C/C++, Fortran)

knowledge of open API/Swagger stack

experience in Scrum/agile and test-driven development

knowledge of and experience in various data storage technologies: relational databases (Postgres), document-based NoSQL databases (MongoDB, Cassandra, HBase), in-memory key-value stores (REDIS), large data storage devices and related technologies (Apache Hadoop, Hive, Presto, HDF5)

knowledge of and preferably experience in distributed computing (MapReduce, Apache Spark and particularly Pachyderm)

preferably experience in Vim and Emacs

knowledge of statistics

excellent collaboration and communication skills

good command of English

an interdisciplinary working environment at a leading scientific institution

the opportunity to become part of the national and international scientific community

exciting working environment on an attractive research campus, ideally situated between the cities of Cologne, Düsseldorf and Aachen

a comprehensive further training programme for professional development

flexible working hours and various opportunities to reconcile work and family life

employment initially for a fixed term of three years but with long-term prospects

full-time position with the option of slightly reduced working hours

Salary and social benefits in conformity with the provisions of the Collective Agreement for the Civil Service (TVöD)

For more information, please visit our website (

Forschungszentrum Jülich aims to employ more women in this area and therefore particularly welcomes applications from women.

We also welcome applications from disabled persons. — ein Projekt von DIPLANET Digital Solutions Impressum