Zaposlitveni oglasi » Software engineer, data acquisition
Software engineer, data acquisition @ Soniox d.o.o.
- objavljeno ::
Opis delovnega mesta
At Soniox, our mission is to make voice universally accessible and programmable in real time. Our models depend on vast, diverse, and high-quality datasets to train state-of-the-art AI systems across languages and domains. As a software engineer working on data acquisition, you will lead the development of scalable infrastructure for acquiring, indexing, and managing data at massive scale, powering the next generation of speech and language models.
Od kandidatov zahtevamo
In this role, you will:
- Design, build, and scale systems for web crawling, large-scale data ingestion, and content indexing.
- Work closely with data processing and model training teams to ensure smooth and efficient data pipelines.
- Own backend infrastructure for storage, indexing, and search across multi-petabyte datasets.
- Architect distributed systems that are robust, performant, and optimized for research and production workloads.
- Deploy and operate services in a Kubernetes environment using Infrastructure-as-Code.
- Analyze system performance and data coverage through experimentation and instrumentation.
You might thrive in this role if you:
- Have 6+ years of experience building large-scale software systems.
- Have deep knowledge of distributed systems, web crawling, and backend engineering.
- Are comfortable with key-value stores, data synchronization, and scalable storage systems.
- Are pragmatic and curious, unafraid to try new tools and rethink old assumptions.
- Communicate clearly and proactively, especially across cross-functional teams.
- Care about building infrastructure that directly powers real-world AI systems.
Kandidatom ponujamo
What we offer
- The chance to work on foundational AI that redefines how humans and machines communicate.
- Global impact: your work will touch millions (and soon billions) of people across languages and cultures.
- End-to-end ownership in a lean, engineering-driven team with no bureaucracy.
- Collaboration with world-class talent in research, engineering, and product.
- A fast-growing startup environment where you shape both the technology and the company's future.
- Competitive compensation with equity ownership.
- Flexible work setup with emphasis on in-person collaboration.
- Regular team events, offsites, and a strong learning-driven culture.
Kontakt
https://soniox.com/careers/software-engineer-data-acquisition
Klasifikacija delovnega mesta
- Lokacija:
- Ljubljana
- Plačilo:
- €3000 - €6000 gross and equity, plus performance bonus EUR / mesec
- Delovni čas:
- redna zaposlitev
Zahtevana znanja
- Design, build, and scale systems for web crawling, large-scale data ingestion, and content indexing.
- napredno znanje
O podjetju
Soniox is building the world's most advanced real-time conversational AI platform: designed to understand every conversation, in every language, anywhere. Our technology goes beyond speech-to-text: we deliver low-latency transcription, translation, and reasoning across 60+ languages, enabling businesses and developers to build next-generation products powered by voice.
We are innovating across the full stack of foundational AI for speech: from large-scale data acquisition and unsupervised dataset generation, to novel model architectures, training methodologies, and optimized inference engines. This holistic approach gives Soniox unmatched accuracy and efficiency compared to Google, OpenAI Whisper, AWS, and others.
Our API powers a wide range of applications: from live translation and meeting assistants to healthcare, call centers, enterprise productivity, and accessibility solutions.
Soniox is a fast-growing startup backed by global enterprise partners. Our engineering and product development hub is in Ljubljana, Slovenia, and we are expanding internationally. Our mission is simple but ambitious: make voice AI work for 8 billion people.
Zakaj bi želel delati za vas
At Soniox you don't just work with AI, you create the AI that the world depends on:
- Foundation of communication: Not just another AI company, we're building how the world will talk to machines and each other.
- True innovation: We invent at every layer, data, model architecture, training, and inference. Not just wrapping someone else's model.
- Global impact: Our mission is voice AI for 8 billion people, including regions that never had reliable AI before.
- State of the art: We beat Google, OpenAI Whisper, AWS, and others. You'll work on the very frontier.
- Ownership & speed: Lean, fast, engineering-driven. You ship real innovations directly to users without red tape.
- World-class team: Our hub in Ljubljana, Slovenia is packed with talent obsessed with breakthrough AI. You'll work closely with founders and researchers.
- Generational opportunity: Speech is the next computing platform. Join now to help shape that future.
Programerski vprašalnik
- Uporabljamo programsko opremo za nadzor izvorne kode (source control)
- Uporabljamo rešitev za spremljanje baze napak (bug database)
- Uporabljamo najboljša orodja, ki se jih dobi na trgu
- Obstaja terminski načrt razvoja
- Programiramo skladno s pisno specifikacijo
- Napake odpravimo pred pisanjem nove kode
- Zaposlene imamo beta-testerje
- Unit testing
- Zaposleni imajo mirno delovno okolje
- Iskalci zaposlitve na intervjujih programirajo
- Zaposlenim vsaj enkrat na dan zagotavljamo topel obrok
- Zaposlenim zagotavljamo prostor za malice
- Zaposlenim nudimo sprostitvene aktivnosti zunaj delovnega časa
- Zaposlenim zagotavljamo parkirno mesto