nlp

职责

——负责电子商务场景中与文字内容（如产品说明、图片文字、口头广播说明等）相关的风险识别模型

-负责产品/短视频/直播评论、报道等舆情内容的风险挖掘算法，提高平台治理效果

——构建行业领先的内容识别方法，构建具有对抗行为的文本内容的跨域、多维识别策略

-探索NLP相关前沿技术，并将其应用于电子商务业务场景

-支持生成可扩展和优化的AI/机器学习（ML）模型

-专注于构建用于提取、转换和加载大量实时非结构化数据的算法，以从理论数据科学模型中部署AI/ML解决方案。

-运行实验来测试部署的模型的性能，并识别和解决过程中出现的错误。

-在团队环境中工作，应用公司要求的统计、脚本和编程语言知识。

-与部署模型的相关软件平台合作。

资格

-扎实的自然语言处理算法基础，对文本分类、相似度匹配、对话问答、机器翻译、序列标注、知识图谱、意图理解、词义消歧等领域有深入的理解和实践经验

——熟悉常用的机器学习和深度学习算法，了解基本的网络模型结构（DNN/LSTM/CNN等）和文本表示方法（LDA/Word2Vec/ELMo/GPT/BERT等），对深度学习训练和推理模型调优有实际经验

-动手能力强，熟练使用至少一个主流深度学习框架（TensorFlow/PyTorch/Caffe/MXNet），了解分布式训练、提炼加速等实现方法

Responsibilities

- Responsible for the risk identification model related to the text content (such as product description, picture text, oral broadcast description, etc.) in the e-commerce scene

- Responsible for product/short video/live comments, reports and other public opinion content risk mining algorithm, improve the platform governance effect

- Build an industry-leading content recognition method, and build a cross-domain and multi-dimensional recognition strategy for text content with adversarial behavior

- Explore NLP-related cutting-edge technologies and apply them to e-commerce business scenarios

- Support the production of scalable and optimised AI/machine learning (ML) models

- Focus on building algorithms for the extraction, transformation and loading of large volumes of realtime, unstructured data to deploy AI/ML solutions from theoretical data science models.

- Run experiments to test the performance of deployed models, and identifies and resolves bugs that arise in the process.

- Work in a team setting and apply knowledge in statistics, scripting and programming languages required by the firm.

- Work with the relevant software platforms in which the models are deployed.

Qualifications

- Solid foundation of NLP algorithm, in-depth understanding and practical experience in text classification, similarity matching, dialogue question and answer, machine translation, sequence tagging, Knowledge Graph, intention understanding, word meaning disambiguation and other fields

- Familiar with commonly used machine learning and deep learning algorithms, understand the basic network model structure (DNN/LSTM/CNN, etc.) and text representation methods (LDA/Word2Vec/ELMo/GPT/BERT, etc.), and have practical experience in deep learning training and reasoning model tuning

- Strong hands-on ability, proficient in using at least one mainstream deep learning framework (TensorFlow/PyTorch/Caffe/MXNet), and understanding of distributed training, distillation acceleration and other implementation methods

- Excellent problem analysis and problem solving skills, have certain processing methods and optimization experience on domain migration, small sample construction, text mining, unsupervised/semi-supervised and other issues

- Master basic big data related components (Hadoop/Spark/Hive/Flink), and have experience in large-scale text data processing and cleaning

- Familiar with Linux development environment, proficient in C++ /go/python at least one programming language, and solid foundation in algorithms and data structures

- Good sense of teamwork and communication skills, practical experience in relevant business scenarios is preferred

- Published papers in high-level computer science conferences (ACL, EMNLP, NIPS, AAAI, etc.) or have competition experience are preferred

公司地点：北京海淀黄庄[地铁站]

公司简介：

发光如星是一家有着独特基因的猎头公司，顾名思义，公司希望每一个发光如星的员工都可以煜煜生辉，同样也希望整个公司能在互联网这个行业中绽放自己的光芒；在其他同业公司都在追求快速扩张的时候，我们在努力的夯实公司发展的底层地基。更关注员工的个人成长和组织的核心稳定性，本着对客户和顾问负责的态度，稳扎稳打，不断打磨自己的培训体系；为行业培养了越来越多的优秀顾问

nlp

公司地点：北京海淀黄庄[地铁站]

公司简介：

职位发布者：贺女士

融资阶段：

公司规模：20~99人

相似职位：