Job Description: - This is a Big Data Administrator Lead position and not a developer position.
- The Lead Data Engineer is responsible for orchestrating, deploying, maintaining and scaling cloud OR on-premise
- infrastructure targeting big data and platform data management (e.g., data warehouses, data lakes) including data access APIs.
- Prepares and manipulates data using Hadoop or equivalent.) with emphasis on high availability, reliability, automation and performance.
- This role will focus on leading the migration and set up of the Enterprise Data Platform on Cloud using a combination of Cloudera CDP public cloud and other AWS services.
- Advanced (expert preferred) level experience in administrating and engineering relational databases (ex. MySQL, PostgreSQL), Big Data systems (ex. Cloudera Data Platform Private Cloud and Public Cloud), Apache Solr as SME, ETL (ex. Ab Initio), BI (ex. MicroStrategy), automation tools (ex. Ansible, Terraform, Bit Bucket) and experience working cloud solutions (specifically data products on AWS) are necessary.
- At least 10 years of Experienced with all the tasks involved in administration of big data and Meta Data Hub such as Cloudera. Solr experience is a MUST.
- Experience with Ab Initio, EMR, S3, Dynamo DB, Mongo DB, ProgreSQL, RDS, DB2 is a Plus.
- DevOps (CI/CD Pipeline) is a Plus.
- Experience with Advance knowledge of UNIX and SQL.
- Experience with manage metadata hub-MDH, Operational Console and troubleshoot environmental issues which affect these components.
- Require prior experience with migration from on-premise to AWS Cloud.
- Represents team in all architectural and design discussions.
- Knowledgeable in the end-to-end process and able to act as an SME providing credible feedback and input in all impacted areas.
- Require tracking and monitoring projects and tasks as the lead.
Essential Function: - Weight Essential Functions.
- 20% Represents team in all architectural and design discussions.
- Knowledgeable in the end-to-end process and able to act as an SME providing credible feedback and input in all impacted areas.
- Require project tracking and task monitoring. the lead position ensures an overall successful implementation especially where team members all are working on multiple efforts at the same time.
- Lead the team to design, configure, implement, monitor, and manage all aspects of Data Integration Framework.
- Defines and develop the Data Integration best practices for the data management environment of optimal performance and reliability.
- Plan, develop and lead administrators with project and efforts, achieve milestones and objectives.
- Oversees the delivery of engineering data initiatives and projects including hands on with install, configure, automation script, and deploy.
- 20% Develops and maintains infrastructure systems (e.g., data warehouses, data lakes) including data access APIs.
- Prepares and manipulates data using Hadoop or equivalent MapReduce platform.
- 15% Develop and implement techniques to prevent system problems, troubleshoots incidents to recover services, and support the root cause analysis.
- Develops and follows standard operating procedures (SOPs) for common tasks to ensure quality of service.
- 15% Manages customer and stakeholder needs, generates and develops requirements, and performs functional analysis.
- Fulfills business objectives by collaborating with network staff to ensure reliable software and systems.
- Enforces the implementation of best practices for data auditing, scalability, reliability, high availability and application performance.
- Develop and apply data extraction, transformation and loading techniques in order to connect large data sets from a variety of sources.
- 15% Acts as a mentor for junior and senior team members.
- 10% Installs, tunes, upgrades, troubleshoots, and maintains all computer systems relevant to the supported applications including all necessary tasks to perform operating system administration, user account management, disaster recovery strategy and networking configuration.
- 5% Expands engineering job knowledge and leading technologies by reviewing professional publications; establishing personal networks; benchmarking state-of-the-art practices; educational opportunities and participating in professional societies.
Problem Complexity and Problem Solving Timeframes: - Works on significant and unique issues where analysis of situations or data requires and evaluation of intangibles.
- Aware and responds to changing and interconnected variables.
- Exercises independent judgment in methods, techniques and evaluation criteria for obtaining results.
- Problem/Task resolution timeframe: Inclusive of shorter timeframes, but typically twelve months or more to resolve.
Level of Supervision Received: - Establishes personal standards of performance within broad framework of policy and objectives as set by senior management.
Impact: - Erroneous decisions or recommendations would normally result in failure to reach goals crucial to significant organizational objectives and would profoundly effect the image of the organization.
Contact with Others: - Acts as prime consultant on significant tasks that affect the organization's long-term goals and objectives.
- Interacts with senior management and senior value-chain partners both internally and externally on matters requiring coordination and decision-making across organizational lines.
Qualification: - To perform this job successfully, an individual must be able to perform each essential duty satisfactorily.
- The requirements listed below are representative of the knowledge, skill, and ability required.
- Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Education: - Bachelor's Degree in Information Technology, Computer Science or related field.
Experience: - 8 years of relevant engineering experience.
- In Lieu of Education.
- In lieu of a Master's degree, an additional 4 years of relevant work experience is required in addition to the required work experience.
Knowledge, Skills and Abilities: - Knowledge of programming languages and web based technologies.
- Experience with Cloudera CDP on-prem and public cloud; Solr SME.
- Ability to collaborate to solve technical problems across teams.
- Excellent communication skills both written and verbal.
•
Last updated on Jun 15, 2023