
Chi Tiết Tin Tuyển Dụng AI Engineer Tại CÔNG TY TNHH THƯƠNG MẠI DỊCH VỤ VEXERE
- Hồ Chí Minh: 384 Hoang Dieu, Ward 6, Quận 4, Quận 4
Mô Tả Công Việc AI Engineer Với Mức Lương Thỏa thuận
We are rebuilding every customer and employee touch-point as an AI-native experience. That ranges from booking a flight/bus/train seat to automating sale processes or leave requests. Engineers here get clear goals, full context, and the freedom to ship quickly while keeping reliability and safety paramount.
Major projects you will tackle:
Omni-channel customer chatbot that handles FAQs, after-sales, and ticket booking for bus, flight, train, and vehicle-rental verticals- across text and voice.
Department-specific assistants for HR onboarding, finance invoice queries, operations incident triage, IT help-desk, and more- each powered by shared LLM components and n8n triggers.
Company-wide automation hub built on n8n, including reusable nodes and guard-rails that let non-technical teams create flows safely.
Multimodal expansion that blends text, speech, and images so customers can, for example, upload a ticket photo or speak a booking change request.
You will fine-tune foundation models, fuse retrieval with LLM reasoning, and iterate in Vietnamese, English, and other languages.
What you will do:
Design, implement, and continually improve our multi-agent framework. Build and refine agent components such as short- and long-term memory stores, planning/reflection loops, agent-to-agent messaging, and specialised prompt templates that let multiple agents collaborate on complex tasks.
Select and adapt OpenAI, Gemini models, Llama, Mixtral or better- using LoRA or full fine-tuning- so the models speak our brand voice in multiple languages.
Build and optimise retrieval-augmented pipelines, keeping latency below two seconds and hallucinations under five percent.
Craft prompts, refusal rules, and jailbreak tests; automate factuality, safety, and multimodal hallucination checks in CI.
Package models with Docker and serve them through FastAPI or gRPC behind vLLM, Triton Inference Server, or Text-Generation-Inference; add monitoring with Grafana / Prometheus for latency, drift, and GPU cost (for later milestones).
Pair daily with Conversation Designers, MLOps engineers, Automation engineers, and business stakeholders; publish model cards, data sheets, and rollback plans.
Với Mức Lương Thỏa thuận Thì Cần Những Yêu Cầu Công Việc Gì
Practical knowledge of vector databases such as milvus, pgvector, Qdrant, or Pinecone and hybrid-search techniques.
Demonstrated ability to raise containment or reduce hallucinations through data-driven experiments.
Solid engineering habits: Git, code reviews, unit and integration tests, CI/CD pipelines.
Clear spoken and written English; Vietnamese fluency is a bonus.
Nice-to-haves:
Hands-on experience with Rasa, Dialogflow CX, or ASR/TTS pipelines for voice bots.
Deep GPU-performance tuning, including quantisation, KV-cache optimisation, or custom Triton kernels.
Familiarity with multi-agent frameworks such as PydanticAI, CrewAI, LangGraph or others.
A privacy-by-design mindset and working knowledge of GDPR or PDPA compliance.
Tại CÔNG TY TNHH THƯƠNG MẠI DỊCH VỤ VEXERE Thì Được Hưởng Những Gì
Hybrid Working
Clear career progression with opportunities for advancement to key positions based on your capabilities.
Vibrant and dynamic working environment with a friendly and supportive team that shares knowledge and assists each other.
Training and development opportunities in negotiation, communication, work management, interpersonal skills, and software technology.
Free parking and allowances: Marriage, Newborn baby and others are applied.
A spacious pantry fully equipped with a coffee maker, microwave, milk , tea and more
Cách Thức Ứng Tuyển Tại CÔNG TY TNHH THƯƠNG MẠI DỊCH VỤ VEXERE
Ứng viên nộp hồ sơ trực tuyến bằng cách bấm Ứng tuyển ngay dưới đây.
Hạn nộp hồ sơ: 22/06/2025
Nộp hồ sơ ứng tuyển tại job3s.online
