Artificial Intelligence & Analytics - Data Science Manager

  • Blawenburg, NJ


: $115,815.00 - $176,770.00 /year *

Employment Type

: Full-Time


: Information Technology

Loading some great jobs for you...

Capgemini is expanding its footprintrapidly. As part of the fastest growing digitalpractice within Capgemini, we work with the latest advanced analytics, machine learning,and big data technologies to extract meaning and value from data in a number ofdifferent industries ranging from Media & Entertainment to Life Sciencesand everywhere in-between. Our team hasworked with geospatial data, on social media sentiment analysis, builtrecommendation systems, created image classification algorithms, solvedlarge-scale optimization problems, and harnessed the massive influx of datagenerated by the IoT.

The Data Science & Analyticsgroup is the fastest growing digital practice at Capgemini demanding agileinnovation. As part of the Data Science& Analytics group, you will work in a collaborative environment withinternal and client resources to understand key business goals, buildsolutions, and present findings to client executives while solving real-worldproblems. If you are passionate about solving problems in the realm ofcognitive computing, big data, and machine learning while utilizing businessacumen, statistical understanding, and technical know-how, the Data Science& Analytics practice group at Capgemini is the best place to grow yourcareer.

Role & Responsibilities:

Work with team to build out data science use case (PoC) related tostaffing and new assignments.

Leverage machine learning models and visualization tools.

Assess current state data science governance model andarchitecture and propose future state CoE model.

Leverage machine learning models and visualization tools.

Assess current state data science governance model andarchitecture and propose future state CoE model.

Work in collaborative environment with globalteams to drive client engagements in a broad range of industries: Aerospace & Defense, Automotive, Banking,Consumer Products & Retail, Financial Services, Healthcare, High Tech,Industrial Products, Insurance, Life Sciences, Manufacturing, Public Sector,Telecom, Media & Entertainment, and Energy & Utilities.

Quickly understand client needs, developsolutions, and articulate findings to client executives.

Provide data-driven recommendations to clientsby clearly articulating complex technical concepts through generation anddelivery of presentations.

Analyze and model both structured andunstructured data from a number of distributed client and publicly availablesources.

Perform EDA and feature engineering to bothinform the development of statistical models and generate improve modelperformance and flexibility.

Design and build scalable machine learningmodels to meet the needs of given client engagement.

Assist with the mentorship and development ofconsultants.

Assist in growing data science practice bymeeting business goals through client prospecting, responding to proposals,identifying and closing opportunities within identified client accounts.


3-5 years professional work experience as a datascientist or on advanced analytics / statistics projects.

Machine Learning

R/Python programming

Natural Language processing

Text analytics

Visualization platforms including Tableau and/or PowerBI

Ability to generate professional visualizations andreporting

Ability to interface with client SMEs

Ability to understand analytics business requirements and developcustom models and reporting

Experience in consulting environment a plus

Skills with Neo4J/Cyper

Masters degree from top tier college/universityin Computer Science, Statistics, Economics, Physics, Engineering, Mathematics,or other closely related field.

Excellent Pythonskills

  • Experience with entity matching , recordlinkage and data cleansing (probabilistic distance)

  • Experience with blocking methods

  • Experience with PySparkPhDpreferred.

Strong understanding and application ofstatistical methods and skills: distributions, experimental design, varianceanalysis, A/B testing, and regression.

Statistical emphasis on data mining techniques,Bayesian Networks Inference, CHAID, CART, association rule, linear andnon-linear regression, hierarchical mixed models/multi-level modeling, andability to answer questions about underlying algorithms and processes.

Experience with both Bayesian and frequentistmethodologies.

Mastery of statistical software, scriptinglanguages, and packages (e.g. R, Matlab, SAS, Python, Pearl, Scikit-learn,Caffe, SAP Predictive Analytics, KXEN, ect.).

Knowledge of or experience working with databasesystems (e.g. SQL, NoSQL, MongoDB, Postgres, ect.)

Experience working with big data distributedprogramming languages, and ecosystems (e.g. S3, EC2, Hadoop/MapReduce, Pig,Hive, Spark, SAP HANA ect.)

Expertise in machine learning algorithms andexperience using the following ML techniques: Logistic Regression, DecisionTrees, Random Forests, Gradient Boosting, SVMs, Time Series, KMeans,Clustering, NMF).

Preferred experience with NLP, Graph Theory, NeuralNetworks (RNNs/CNNs), sentiment analysis and Azure ML.

Experience building scalable data pipelines andwith data engineering/ feature engineering.

Preferred experience with web-scrapping.

Experience building and deploying predictivemodels.

Experience with PowerPoint and ability toclearly articulate findings and present solutions.

Excellent team-oriented and interpersonalskills.

Applicants for employment in the US must havevalid work authorization that does not now require sponsorship of a visa foremployment authorization in the US by Capgemini

(Includes Data Modeler, Data Miner.) Responsible for importing, cleaning, transforming, validating and modeling data with the purpose of understanding and drawing conclusions from data (may be presented in charts, graphs, and/or tables). Also, design and develop relational databases for collecting and storing data and build and design data input and data collection mechanisms.

Required Skills and Experience:

You are responsible for data related activities such as data extraction, profiling, cleansing, de-duplication, standardization, conversion, transformation and loading, data mining, warehousing, archiving and reporting. Responsible for all activities required to ensure optimum performance and data integrity of databases in production environments, in line with the requirements. Responsible for server based databases in development and test environments including database software installation, database creation, performance and capacity design, backup and recovery design, security design.

Qualifications: 3 7 years (2 years min relevant experience in the role), Bachelors Degree.

Should be proficient in Software Engineering Techniques, Software Engineering Architecture, Software Engineering Lifecycle and Data Management.

Should have progressing skills in Business Analysis, Business Knowledge, Software Engineering Leadership, Architecture Knowledge and Technical Solution Design.

Capgemini is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.

This is a general description of the Duties, Responsibilities and Qualifications required for this position. Physical, mental, sensory or environmental demands may be referenced in an attempt to communicate the manner in which this position traditionally is performed. Whenever necessary to provide individuals with disabilities an equal employment opportunity, Capgemini will consider reasonable accommodations that might involve varying job requirements and/or changing the way this job is performed, provided that such accommodations do not pose an undue hardship.

Click the following link for more information on your rights as an Applicant -

About Capgemini

A global leader in consulting, technology services and digital transformation, Capgemini is at the forefront of innovation to address the entire breadth of clients opportunities in the evolving world of cloud, digital and platforms. Building on its strong 50 year heritage and deep industry-specific expertise, Capgemini enables organizations to realize their business ambitions through an array of services from strategy to operations. Capgemini is driven by the conviction that the business value of technology comes from and through people. It is a multicultural company of 200,000 team members in more than 40 countries. The Group reported 2018 global revenues of EUR 13.2 billion.

Visit us at . People matter, results count.



Title:Artificial Intelligence & Analytics - Data Science Manager


Requisition ID:043582

Associated topics: data analytic, data architect, data center, data engineer, data quality, data warehouse, database administrator, etl, mongo database, sybase

* The salary listed in the header is an estimate based on salary data for similar jobs in the same area. Salary or compensation data found in the job description is accurate.

Launch your career - Upload your resume now!

Upload your resume

Loading some great jobs for you...