Job description
Job Responsibilities:
1️⃣ Responsible for building a sustainable and iterative intelligent dialogue system.
2️⃣ Develop multi-turn dialogue management and context understanding modules to enhance interaction effects in complex scenarios.
3️⃣ Establish user profiles and memory mechanisms to support personalized recommendations and differentiated services.
4️⃣ Build a collaborative reasoning architecture between large models and lightweight models to achieve a balance between performance and cost.
5️⃣ Develop intelligent applications and plugins based on the Dify platform, promoting modularization and rapid iteration of business modules.
6️⃣ Integrate and optimize mainstream large language models (such as GPT-5, Claude, DeepSeek) to enhance system intelligence.
7️⃣ Deploy medium and small-scale models in edge computing and resource-constrained environments to ensure inference efficiency.
8️⃣ Promote multimodal interaction, covering various input and output methods such as text, voice, and images.
Job Requirements:
1️⃣ Over 3 years of development experience, capable of independently completing project delivery.
2️⃣ Experience in developing intelligent dialogue systems or intelligent customer service.
3️⃣ Proficient in #Python, with experience in distributed and high-concurrency system architecture.
4️⃣ Practical delivery and maintenance experience of large-scale intelligent customer service or dialogue systems, ensuring system stability and high availability.
5️⃣ Familiar with conversation memory mechanisms and context management (such as long-term and short-term memory, RAG), capable of supporting complex dialogue scenarios.
6️⃣ Proficient in using the Dify platform for application development and workflow orchestration, with modular design capabilities.
7️⃣ Experience in calling, tuning, and multimodal integration of large language model APIs.
8️⃣ Mastery of prompt design and context engineering, able to apply few-shot reasoning and chain reasoning methods.
9️⃣ Experience in optimizing and deploying medium and small-scale models.
🔟 Understanding of communication protocols such as MCP and A2A, capable of supporting cross-system integration.
1️⃣1️⃣ Familiar with RESTful / GraphQL API design, mastering microservices architecture and containerized deployment.
1️⃣2️⃣ Experience in high-concurrency architecture optimization, capable of system monitoring and resource scheduling.
1️⃣3️⃣ Technical stack includes: LangChain, Transformers, Milvus, Dify, Python (FastAPI), PostgreSQL, Redis, Kafka, Docker/Kubernetes.
