5+ years of experience with NoSQL databases like MongoDB, ElasticSearch, and Hadoop., Strong scripting skills in Python and Shell, with knowledge of automation tools like Ansible and Terraform., Experience with cloud databases in environments like AWS, Azure, and GCP., Excellent communication and collaboration skills, with a strong understanding of database technologies..
Key responsibilities:
Administer and maintain MongoDB, Elastic, and Hadoop clusters, including performance analysis and capacity planning.
Coordinate with application teams for database capacity planning and assist in resolving performance bottlenecks.
Develop automated solutions for script execution and report generation, and manage database upgrades.
Act as a technical point of contact for Hadoop applications, ensuring optimized solutions for new and existing applications.
Report This Job
Help us maintain the quality of our job listings. If you find any issues with this job post, please let us know.
Select the reason you're reporting this job:
CatchProbe is the world's leading international technology company that provides actionable Web Intelligence, OSINT, Deception Systems, Threat Intelligence and Digital Crime Analytics solutions and products to many organizations with the world's first AI-Driven SaaS-based centralized and autonomous intelligence platform.(IntelligencySuite, version IDA Mountain)
CatchProbe is an enhanced centralized actionable intelligence orchestration platform for intelligence gathering, enrichment, deception, profiling or create a targeting, semantics and attribution.
It collects all the intelligence needed to prevent potential threats, enriches, correlates, reports and prevention the intelligence it has obtained for accurate analysis.
In addition to verified threat intelligence data collected from open, private and owned sources, deception systems (AI-Based Autonomous Deceptions), leaked data (Leak DBs) and Dark/Deep Web resources are also used by CatchProbe and accessed through the centralized interface.
MAKES A DIFFERENCE WITH CATCHPROBE MODULES Enriching and making sense of the collected intelligence data is among the priority objectives. In this way, a wide and detailed research and analysis platform are created for analysts. Additional modules have been developed to expand the intelligence to be collected and to make multiple resources available to customers. Analyzing all the collected data thanks to the analytics platform and reaching meaningful results are among the benefits provided.
INTELLIGENCE GATHERING IS EASY WITH CATCHPROBE
Experience the ease of intervention and flexibility of actionable intelligence as a result of the power of artificial intelligence and analytical insights, with your integrations with infrastructure and systems, while autonomously managing escalation and delegation between authorized analysts. Automatically profile and detect your attackers.
Workplace Type : Remote - Region : San Francisco, CA
Job Description
Must have experience with MongoDB installations, upgrades, support on MongoDB
Responsible for administration, maintenance, Performance analysis, and Capacity planning for Mongo dB/Elastic/Hadoop clusters.
Coordinate and plan with Application teams on Mongodb capacity planning for new applications.
Should have knowledge of using mongo base tools - like mongodump , mongoexport ,mongorestore, mongoimport , mongostat , mongotop
Must be well-versed with JSON scripting , writing queries in mongo in shell scripts and in the mongo shell
Should be able to support sharded clusters and perform upgrades and other config maintenance on sharded clusters
Must be able to address , monitor and manage capacity requirements - all aspects CPU , memory and storage .
Must be able to assist application teams with assessment and/or resolution of performance bottlenecks observed in the MongoDB Database tier of their stack
Must be aware of different authentication/authorization methods used in Mongo - SCRAM-SHA1 , X509 , LDAP - including reconfiguration of instances with TLS/SSL and/or LDAP
Candidate must also be able to develop automated solutions for ad-hoc script execution requests, ad-hoc report generation, upgrades , installs
Experienced in NoSQL DB technologies
Must be aware of how to use Cloud manager and share relevant metrics for a given deployment when an issue arises
Must have experience with docker, deploying Mongo containers running on docker and supporting all aspects of MongoDB administration needs within a docker container
Knowledge of administration and support of Hadoop systems will be an added advance.
Deploy Hadoop(Bigdata) cluster, Comm/decommissioning of nodes, track jobs, monitor services like zookeeper, hbase, SOLR indexing, configure name-node HA, schedule, configuring backups & restore
To develop script to review logs and alert us in case of long running queries.
Demonstrable expertise (deployment, use) in some of the following data storage technologies: Postgres/MySQL (“on-prem”, RDS, Aurora), Kafka / Kinesis (“on-prem” or managed),
Strong scripting experience with Python (preferred), and Shell (secondary).
Individually build services, and expose internal APIs for these services that allow other teams, and workflows to use data infrastructure automation components.
Required Skills
Strong Understanding of various relational and non-relational database technologies along with their benefits, downsides and best use-case and help application teams to use the correct database technology based on their specific business use case.
5+ year installing, automating, scaling and supporting NoSQL databases such as MongoDB, ElasticSearch, Hadoop, among other emerging technologies.
1-2 years Experience working with the databases in public Clouds like Hetzner, AWS, Azure and GCP.
Proficiency in automation
Knowledge of Ansible, Python, Terraform
Willingness and commitment to learn other database, automation, and Cloud technologies
Great communication and collaboration skills
Software development experience and knowledge of modern software development processes.
Knowledge/experience in development programming languages such as Java, Go, Nodejs, html, CSS, bootstrap etc. is a plus.
Knowledge/experience on AI/ML is a big plus.
Ability to multi-task and prioritize with little to no supervision, providing team leadership skills.
Ability to work well under pressure
Consistent exercise of independent judgment and discretion in matters of significance
Excellent communication skills
Highly driven, highly involved, highly proactive
Datalake cluster ownership and Technical point of contact for all applications on Hadoop cluster
Responsible for new application on-boarding in Datalake by reviewing requirement and design.
Assist existing and new applications to come up with most optimized and suitable solutions for their requirement
L3 point of contact for issues related to Hadoop platform
Core Responsibilities
Develops solutions for very complex and wide reaching systems engineering problems. Sets new policies and procedures to handle future issues. Creates systems engineering and architectural documentation to be used by others to build and maintain systems.
Operating Systems & Disk Management: Provides in-depth knowledge of Operating System internals to aid in solving complex problems. Acts as an authority on at least one supported Operating System. Mentors and trains more junior team members on Operating System concepts, configuration, tuning and troubleshooting techniques. Creates basic task automation scripts in Shell, Perl, VBscript, or similar. May manage servers remotely in a distributed environment.
Database Platform Management: Masters understanding of database concepts, availability, performance, usage and configuration. Sets up, fixes and tunes complex standard and non-standard replication. Uses knowledge of existing database platforms to evaluate and recommend new technologies. Uses database knowledge to solve issues on unfamiliar products. Creates and maintains database policies, standards and overall documentation including availability, replication, availability and backup and recovery policy, service level agreement, baseline architecture, change management, access to production, unsupported HW/SW, security & audit violations and risk acceptance.
Storage and Backup: Sets up, solves problems and tunes complex SAN software issues. Uses knowledge of existing storage platforms to evaluate and recommend new technologies. Uses storage knowledge to solve issues on unfamiliar products. Creates and maintain policies, standards and overall documentation including availability and backup & recovery, service level agreement, baseline architecture, change management, access to production, unsupported HW/SW, security & audit violations and risk acceptance.
Scripting and Development: Develops software in several modern languages. Develops database-backed systems and has a solid grasp of DB schema and query performance. Designs horizontally-scalable solutions with innovative use of storage and networking, including solid APIs for integration with other systems. Applies professional standard methodologies in day-to-day work like revision control, unit testing, or other. Applies statistical data analysis techniques.
Networking: Recommends or helps architect an entire system, including network design and topology. Authority in understanding and performing TCP dumps, snoop and other network sniffers. Understands and applies knowledge of most protocols (TCP/IP, HTTP, UDP etc.)
Application Technologies (Web Servers, J2EE, Applications Servers): Provides authoritative recommendations and advice to the team and/or department in the areas of web services, OS and storage, including being an active liaison to Development, QA and the Business. Provides scaling, design, costing, troubleshooting and impact analysis consultation.
Analyzes systems and makes recommendations to prevent possible problems. Takes lead on issue resolution activities using knowledge of complex and company-wide systems.
Leads end-to end audit of monitors and alarms based on subsystem knowledge. Takes the lead on defining the requirements for new tools required for operations.
Applies time management and project management skills to lead the resolution of issues in a timely and organized manner, effectively communicating necessary information. May consult directly with clients or third party vendors; provides domain expertise.
Consistent exercise of independent judgment and discretion in matters of significance.
Regular, consistent and punctual attendance.
Required profile
Experience
Spoken language(s):
English
Check out the description to know which languages are mandatory.