Data Acquisition & Enrichment Specialist

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Bachelor's degree in computer science, information systems, or a related field; a Master's degree is a plus., Extensive experience in Python programming for scripting and data processing., Hands-on experience with web scraping libraries such as BeautifulSoup, Scrapy, and Selenium., Solid understanding of data integrity principles and experience with data normalization and deduplication techniques..

Key responsibilities:

  • Use Python scripts and scraping tools to gather business and contact information for the CRM.
  • Clean and normalize collected data to ensure consistency and remove duplicates.
  • Enrich existing CRM records by adding missing attributes from external datasets.
  • Continuously monitor and improve the integrity of CRM data and set up automated workflows for data updates.

HireWithNear logo
HireWithNear Human Resources, Staffing & Recruiting Startup https://hirewithnear.com/
51 - 200 Employees
See all jobs

Job description

Data Acquisition & Enrichment


At Near, we help top talent in Latin America find remote roles with US companies. Our mission is to create better lives by fostering a remote work culture that transcends borders.


About this opportunity:

We are seeking a Data Acquisition & Enrichment Engineer to join our Business Intelligence team. This remote contractor will own the end-to-end process of identifying, extracting, and enriching business and contact data to keep our CRM system comprehensive and accurate.

The ideal candidate is an expert in Python-based web scraping and data cleaning, with a laser focus on execution speed and data integrity. This role does not involve building dashboards or front-end reporting; instead, it centers on back-end data acquisition and maintenance at scale.

Key Responsibilities:

  • Web Data Extraction: Use Python scripts and scraping tools to gather business and contact information from various sources (websites, LinkedIn, databases, APIs) to fill gaps in our CRM. This includes writing and maintaining scrapers (BeautifulSoup, Scrapy, Selenium, Puppeteer, etc.) for continuous data acquisition.
  • Data Cleaning & Deduplication: Clean and normalize the collected data to ensure consistency in formatting (e.g. standardizing company names, titles) and remove duplicates. Implement deduplication rules and normalization processes to maintain high data quality in the CRM.
  • CRM Data Enrichment: Enrich existing CRM records by adding missing attributes (such as industry, employee count, latest contact info, etc.) from external datasets. Data enrichment is the process of adding information to customer data to build richer profiles, which also helps spot outdated or incorrect info. The engineer will update CRM entries via APIs or database loads, ensuring seamless integration.
  • Data Integrity & Maintenance: Continuously monitor and improve the integrity of the CRM data. Identify outdated entries (e.g. people who changed jobs) and update them. Set up automated workflows or schedules for periodic re-scraping and data refresh so information stays current.
  • System Development for Scale: Design and build systems or pipelines to automate data updates at scale. This may involve creating intermediate databases (e.g. MySQL) to store raw scraped data and transformation scripts to process it before updating the CRM. Over time, develop a scalable architecture for continuous data crawling and enrichment.
  • Cross-Team Collaboration: Work closely with the BI Director and sales/marketing teams to understand data needs and priorities. Proactively communicate insights about data gaps or quality issues and propose solutions. Operate with high ownership – manage your projects end-to-end and ensure timely delivery of clean data to stakeholders.

Skills/Experience Needed:

  • Strong Python Skills: Extensive experience in Python programming for scripting and data processing. Must be comfortable building robust web scrapers and data pipelines in Python.
  • Web Scraping Tools: Hands-on experience with web scraping libraries/frameworks such as BeautifulSoup, Scrapy, Selenium, Puppeteer, etc., and understanding of how to scrape data from various formats (HTML, JSON APIs, PDFs). Familiarity with handling anti-scraping measures (IP rotation, captchas) is a plus.
  • Database Proficiency: Experience with MySQL or similar relational databases to store and query data. Ability to write SQL queries and design simple schemas for holding scraped/raw data.
  • Data Quality & Cleaning: Solid understanding of data integrity principles – ensuring completeness, accuracy, and consistency of data. Experience with data normalization and deduplication techniques to merge and purify datasets (for example, cleaning up multiple entries for the same entity).
  • CRM Data Model Familiarity: Basic knowledge of CRM systems (Salesforce, HubSpot, or others) and their data structures for companies/contacts. While direct CRM administration is not required, the candidate should understand how enriched data will map into CRM fields and objects.
  • Autonomous & Proactive: Highly proactive, “get-things-done” attitude. Able to work with minimal supervision, set own targets, and aggressively deliver results. Treats goals like missions – with speed, precision, and full ownership. Comfortable in a high-velocity environment where priorities can shift, and able to adapt and respond quickly.
  • Problem-Solving: Resourceful in finding data and overcoming obstacles (e.g., figuring out alternative data sources or scrape methods if one path is blocked). Strong analytical and troubleshooting skills to debug scraping issues or data discrepancies.
  • Communication: Fluent in English (written and spoken) for effective remote collaboration. Able to clearly document processes, report progress, and communicate findings or issues to a non-technical audience (e.g., sales team) when needed

Education Requirements:

  • Bachelor's degree in computer science, information systems, or a related field.
  • Master’s degree is a plus

What's in it for you?

  • PTO
  • US Holidays
  • 100% remote work

We consider candidates from all around LATAM! This is a full-time, remote position to work with the USA. Sounds like you? Let's see your application!


Required profile

Experience

Industry :
Human Resources, Staffing & Recruiting
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Proactivity
  • Communication
  • Problem Solving

Data Analyst Related jobs