Statistician, Data Sciences

Janssen Pharmaceuticals is currently recruiting for a Statistician, Data Sciences. This position can be located in Raritan, NJ; Titusville, NJ; Spring House, PA; or Raleigh, NC. Occasional travel to the NJ/PA sites will be required. Additional travel up to 15% may be required.


Janssen Research & Development, L.L.C. develops treatments that improve the health and lifestyles of people worldwide. Research and development areas encompass novel targets in neurologic disorders, gastroenterology, oncology, infectious disease, diabetes, hematology, metabolic disorders, immunologic disorders, and reproductive medicine. We have produced and marketed many first-in-class prescription medications and are poised to serve the broad needs of the healthcare market – from patients to practitioners, from clinics to hospitals. For more about Janssen Pharmaceuticals, Inc., one of the Pharmaceutical Companies of Johnson & Johnson, visit


Thriving on a diverse company culture, celebrating the uniqueness of our employees and committed to inclusion.  Proud to be an equal opportunity employer.
Janssen, pharmaceutical companies of Johnson & Johnson, has access to and generates wealth of data. The core mission of the data sciences group is to leverage this data using machine learning, pattern detection, anomaly detection and predictive modeling to create insights and impact health care. 

The Data Sciences group within Pharma IT department of Janssen is looking for an outstanding Statistician who is interested in leading, designing, and developing, solutions to measure privacy risk and implement de-identification and anonymization methods. The candidate for this position will work with a newly formed team and product line for Privacy and De-identification Analytics. The role requires both a broad knowledge of existing data mining solutions, leadership and creativity to invent, customize and work in a multidisciplinary environment to drive business solutions. 
The Statistician, Data Sciences will be part of a dynamic, accomplished informatics team that will support a broad portfolio of Data Scientists across the Enterprise; He/She is accountable for executing business requirements for projects, achieving project objectives, ensuring effective integration of new clinical data.

  • Serve as a technical leader of the project team in the field of Privacy and De-identification Analytics, with specific expertise in the following areas: statistics, de-identification, risk assessment, and the structure and content of clinical data sets.
  • Categorize identifiers and quasi-identifiers, as well as develop the methods to accurately assess re-identification risk and alter data elements to reduce that risk.
  • Work in collaboration with business team and/or Business Relationship Manager to understand business demands and identify opportunities.
  • Participate in development of software tools, applications, and analyses to support need to ensure patient privacy while maintaining utility.
Additional decision making responsibilities:
  • Address complex problems with broad implications for research data, balancing the often competing needs of privacy and utility.
  • Ensure solutions are consistent with business objectives or business strategy.
  • Make decisions regarding resource alignment/dedication and prioritization (people resources, dollars/funding, project criticality) and communicate rationale back to the business and project management.
  • Contribute to the strategic planning process and long-term direction for the Privacy and De-identification team and aligns plans between stakeholders.

  • A Bachelor’s degree in Statistics or related discipline with a minimum of 3 years of relevant experience OR an advanced degree (Master’s or PhD) in Statistics or related discipline.
  • Familiarity with tools and methods to calculate re-identification risk in clinical data is preferred. 
  • Experience building population statistics to compare to the prevalence of elements in the analysis dataset is preferred. 
  • Experience working with complex, multidisciplinary teams in a matrix environment is preferred.
  • Strong data modelling/data mapping/data manipulation and schema design experience and working knowledge of relational database modeling concepts and SQL or equivalent language such as R, SAS, or Python is required.
  • Experience using data profiling / quality tools, experience with ETL activities, knowledge of common data models (e.g. OMOP, i2b2) is preferred.
  • Understanding of clinical data model/standards/vocabularies such as MEDRA, SNOMED, CDISC, CDM and RxNORM is preferred.
  • Experience working in a data warehouse environment with exposure to INFORMATICA, Redshift and Teradata. Knowledge of health outcomes databases (claim, EMR/EHR, survey, observation studies) is preferred.
  • Strong expertise in health informatics, including familiarity with health outcomes databases, clinical trial registries, understanding of IT resource and cost drivers for a franchise or business unit is preferred.
  • Excellent interpersonal skills and able to drive tasks in a diverse team is required.

Primary Location
United States-Pennsylvania-Spring House
Other Locations
North America-United States-North Carolina-Raleigh, North America-United States-New Jersey-Titusville, North America-United States-New Jersey-Raritan
Janssen Research & Development, LLC. (6084)
Job Function
Info Technology
Requisition ID

Share this Job

Other Locations For This Job