Hugging Face
The AI community building the future.
Data Infrastructure Advocate Engineer
Location
New York
Posted
31 days ago
Salary
Not specified
EnglishOpen SourcePandasPython
Job Description
• Grow and nurture the open-source data/infra community—launch initiatives, collaborate with data-focused groups, and organize events or challenges.
• Promote the Hugging Face Hub as the go-to platform for data storage, versioning, and collaboration—curate and showcase datasets, benchmarks, and tools like Xet.
• Highlight use cases like efficient large dataset updates, Parquet editing, and deduplication to demonstrate the Hub’s value for data workflows.
• Create demos, benchmarks, and tools (e.g., Colab notebooks) to illustrate best practices for data storage and versioning.
• Experiment with Xet, Parquet, and other data formats to showcase their potential for ML and data engineering.
• Produce high-quality tutorials, blog posts, and videos that make complex topics accessible.
• Share insights on storage optimization, dataset versioning, and deduplication to empower developers.
• Actively participate in online communities (Discord, GitHub, forums) to highlight contributions, answer questions, and foster collaboration.
• Ensure datasets and tools released on the Hub are well-documented, with clear examples, benchmarks, and use cases.
Job Requirements
- Strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3).
- Hands-on builder who loves experimenting with data tools, storage optimization, and dataset versioning.
- Ability to explain complex topics (e.g., deduplication, compression, Parquet editing) through writing, demos, or talks.
- Active participation in developer communities (GitHub, Discord, forums) and passion for open source and knowledge sharing.
- Thrive in fast-moving environments and enjoy building in public to inspire others.
Benefits
- Health, dental, and vision benefits for employees and their dependents.
- Parental leave.
- Flexible paid time off.
- Flexible working hours and remote options.
- Reimbursement for relevant conferences, training, and education.
- Company equity as part of compensation package.