Hugging Face

The AI community building the future.

Data Infrastructure Advocate Engineer

Full TimeRemoteTeam 51-200Since 2016H1B SponsorCompany SiteLinkedIn

Location

New York

Posted

31 days ago

Salary

Not specified

EnglishOpen SourcePandasPython

Job Description

• Grow and nurture the open-source data/infra community—launch initiatives, collaborate with data-focused groups, and organize events or challenges. • Promote the Hugging Face Hub as the go-to platform for data storage, versioning, and collaboration—curate and showcase datasets, benchmarks, and tools like Xet. • Highlight use cases like efficient large dataset updates, Parquet editing, and deduplication to demonstrate the Hub’s value for data workflows. • Create demos, benchmarks, and tools (e.g., Colab notebooks) to illustrate best practices for data storage and versioning. • Experiment with Xet, Parquet, and other data formats to showcase their potential for ML and data engineering. • Produce high-quality tutorials, blog posts, and videos that make complex topics accessible. • Share insights on storage optimization, dataset versioning, and deduplication to empower developers. • Actively participate in online communities (Discord, GitHub, forums) to highlight contributions, answer questions, and foster collaboration. • Ensure datasets and tools released on the Hub are well-documented, with clear examples, benchmarks, and use cases.

Job Requirements

  • Strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3).
  • Hands-on builder who loves experimenting with data tools, storage optimization, and dataset versioning.
  • Ability to explain complex topics (e.g., deduplication, compression, Parquet editing) through writing, demos, or talks.
  • Active participation in developer communities (GitHub, Discord, forums) and passion for open source and knowledge sharing.
  • Thrive in fast-moving environments and enjoy building in public to inspire others.

Benefits

  • Health, dental, and vision benefits for employees and their dependents.
  • Parental leave.
  • Flexible paid time off.
  • Flexible working hours and remote options.
  • Reimbursement for relevant conferences, training, and education.
  • Company equity as part of compensation package.

Related Categories

Related Job Pages