3-5 years professional work experience as a data scientist or on advanced analytics / statistics projects.
Natural Language processing
Visualization platforms including Tableau and/or PowerBI
Ability to generate professional visualizations and reporting
Ability to interface with client SMEs
Ability to understand analytics business requirements and develop custom models and reporting
Experience in consulting environment a plus
Skills with Neo4J/Cyper
Masters degree from top tier college/university in Computer Science, Statistics, Economics, Physics, Engineering, Mathematics, or other closely related field.
Excellent Python skills
Experience with entity matching , record linkage and data cleansing (probabilistic distance)
Experience with blocking methods
Experience with PySparkPhD preferred.
Strong understanding and application of statistical methods and skills: distributions, experimental design, variance analysis, A/B testing, and regression.
Statistical emphasis on data mining techniques, Bayesian Networks Inference, CHAID, CART, association rule, linear and non-linear regression, hierarchical mixed models/multi-level modeling, and ability to answer questions about underlying algorithms and processes.
Experience with both Bayesian and frequentist methodologies.
Mastery of statistical software, scripting languages, and packages (e.g. R, Matlab, SAS, Python, Pearl, Scikit-learn, Caffe, SAP Predictive Analytics, KXEN, ect.).
Knowledge of or experience working with database systems (e.g. SQL, NoSQL, MongoDB, Postgres, ect.)
Experience working with big data distributed programming languages, and ecosystems (e.g. S3, EC2, Hadoop/MapReduce, Pig, Hive, Spark, SAP HANA ect.)
Expertise in machine learning algorithms and experience using the following ML techniques: Logistic Regression, Decision Trees, Random Forests, Gradient Boosting, SVMs, Time Series, KMeans, Clustering, NMF).
Preferred experience with NLP, Graph Theory, Neural Networks (RNNs/CNNs), sentiment analysis and Azure ML.
Experience building scalable data pipelines and with data engineering/ feature engineering.
Preferred experience with web-scrapping.
Experience building and deploying predictive models.
Experience with PowerPoint and ability to clearly articulate findings and present solutions.
Excellent team-oriented and interpersonal skills.
Candidates should be flexible / willing to work across this delivery landscape which includes and not limited to Agile Applications Development, Support and Deployment.
Applicants for employment in the US must have valid work authorization that does not now and/or will not in the future require sponsorship of a visa for employment authorization in the US by Capgemini.
(Includes Data Modeler, Data Miner.) Responsible for importing, cleaning, transforming, validating and modeling data with the purpose of understanding and drawing conclusions from data (may be presented in charts, graphs, and/or tables). Also, design and develop relational databases for collecting and storing data and build and design data input and data collection mechanisms.
Required Skills and Experience:
You are responsible for data related activities such as data extraction, profiling, cleansing, de-duplication, standardization, conversion, transformation and loading, data mining, warehousing, archiving and reporting. Responsible for all activities required to ensure optimum performance and data integrity of databases in production environments, in line with the requirements. Responsible for server based databases in development and test environments including database software installation, database creation, performance and capacity design, backup and recovery design, security design.
Qualifications: 3-9 years experience, Bachelors Degree.
Should be proficient in Software Engineering Techniques, Software Engineering Architecture, Software Engineering Lifecycle and Data Management.
Should have progressing skills in Business Analysis, Business Knowledge, Software Engineering Leadership, Architecture Knowledge and Technical Solution Design.
Capgemini is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.
This is a general description of the Duties, Responsibilities and Qualifications required for this position. Physical, mental, sensory or environmental demands may be referenced in an attempt to communicate the manner in which this position traditionally is performed. Whenever necessary to provide individuals with disabilities an equal employment opportunity, Capgemini will consider reasonable accommodations that might involve varying job requirements and/or changing the way this job is performed, provided that such accommodations do not pose an undue hardship.
Click the following link for more information on your rights as an Applicant -
A global leader in consulting, technology services and digital transformation, Capgemini is at the forefront of innovation to address the entire breadth of clients opportunities in the evolving world of cloud, digital and platforms. Building on its strong 50 year heritage and deep industry-specific expertise, Capgemini enables organizations to realize their business ambitions through an array of services from strategy to operations. Capgemini is driven by the conviction that the business value of technology comes from and through people. It is a multicultural company of 200,000 team members in more than 40 countries. The Group reported 2018 global revenues of EUR 13.2 billion.
Visit us at www.capgemini.com . People matter, results count.
Organization:I AND D US
Title:Data Analyst Lead - Data Scientist with Spark
Loading some great jobs for you...