The Taub institute and Sergievsky center seeks a Scientific Database Analyst to work on several federally funded multi-center genetics and multi-omics studies. The incumbent reports to the principal investigator and the project manager. Primary responsibilities include managing the storage, querying and linking of genetics and clinical data. Adhere to university policies, technical standards and best practices for operational excellence.
Managing the storage and querying various genetics data (genome-wide chip data, whole exome and whole genome sequencing data).
Linking genetic to Clinical data and creating anonymized datasets for investigators
Test programs or databases, correct errors, and make necessary modifications.
Specify users and user access levels for each segment of database.
Write and code logical and physical database descriptions and specify identifiers of database to management system or direct others in coding descriptions.
Work as part of a project team to coordinate database development and determine project scope and limitations.
Review procedures in database system manuals for making changes to database.
Review data requests with project manager to estimate time required to accomplish project.
QC data requests and work with project manager before distributing to investigators and or analysts.
Maintain database system for various genetic projects and provide feedback to database users.
Assist with development and implementation of tools for genetics analysis.
Perform quality control, assembly, annotation and analysis of genetic subtyping data.
Fulfill required IRB compliance trainings.
Work with analysts and other subject matter experts in the merging and QC of incoming and outgoing data.
Prepare reports and data files as requested
Manage and QC data of the various genetic projects.
Requires a bachelor's degree or equivalent in education, training and experience, plus three years of related experience. Master's degree preferred.
Experience working with Clinical and genomic data.
Experience maintaining data using MS Excel, SPSS, and Progeny.
Experience with MS Excel VBA, Shell Script, AWK.
Experience with Scripting in Python/Perl
Familiarity with sequenced and multi-omics data.
Ability to work independently and yet to maintain the flexibility to respond to requests for support of ongoing projects
2 years of experience in database management.
Experience in SQL and MS Excel
Equal Opportunity Employer / Disability / Veteran
Columbia University is committed to the hiring of qualified local residents.
Internal Number: 506515
About Columbia University
Columbia University is one of the world's most important centers of research and at the same time a distinctive and distinguished learning environment for undergraduates and graduate students in many scholarly and professional fields. The University recognizes the importance of its location in New York City and seeks to link its research and teaching to the vast resources of a great metropolis. It seeks to attract a diverse and international faculty and student body, to support research and teaching on global issues, and to create academic relationships with many countries and regions. It expects all areas of the university to advance knowledge and learning at the highest level and to convey the products of its efforts to the world.