Charles Camp
Verified Expert in Engineering
Machine Learning Developer
Charles拥有人工智能和数据科学两方面的认证. 他确实非常擅长制作高性能模型并使其易于使用. 他能很好地适应各种环境,已经在银行工作过了, startups, IT firms, and laboratories. 他的专业领域是自然语言处理和时间序列分析.
Portfolio
Experience
Availability
Preferred Environment
Python, Amazon Web Services (AWS), Natural Language Processing (NLP), Time Series Analysis, Transformers, Reinforcement Learning
The most amazing...
...我建立的模型可以识别出与金融犯罪有关的人.
Work Experience
AI Developer
Non-Fungible Films, Inc.
- 微调稳定扩散模型,可用于公司的虚拟字符.
- 部署了一个类似Midjourney的Discord bot,但使用了自定义的稳定扩散模型.
- 将模型与稳定的扩散UI集成在一起,以实现绘图, image to image, and other applications.
ML Engineer
Global CPG Company
- 创建了一个管道,利用内部消费者行为数据自动计算相似的受众.
- 比较模型以实现最高性能和超参数调优.
- 创建自定义PySpark和scikit-learn估计器,以集成PySpark和scikit-learn管道, respectively.
ML and NLP Engineer
Phragmites, Inc.
- Set up an EC2 server, analyzed Telegram messages stored on a Postgres DB, 并将它们分类为与特定加密相关的项目相关或不相关.
- 使用接近重复的集群方法构建了一个bot消息检测模型.
- 使用图论量化Telegram用户在以加密为中心的对话中的影响.
- 训练了一个NER模型来检测Telegram消息中的加密项目名称.
Senior Data Scientist
Trust & Safety Laboratory
- 训练机器学习模型在推特中发现有争议的话题. 有争议的话题被定义为可能包含有害的错误信息.
- 训练ML模型来检测推文中的虚假声明和错误信息.
- Built a pipeline to collect human loop reviews (AWS), automated the labeling of potentially misleading tweets, and performed website scraping.
- 开发了一个无服务器框架,使社交媒体筛选任务自动化.
Python Developer | AI
Click Factura SA de CV
- 转录和总结西班牙语音频会议:微调文本到语音模型(DeepSpeech), NeMo, 和Wav2Vec),并使用了文本摘要和日记模型.
- 训练OCR模型提取墨西哥机票信息.
- 通过创建api将模型集成到现有的Django应用程序中.
- Deployed the models using Docker containers and Flask.
Machine Learning Expert | Digital Advertisement
Primal Analytics
- 部署Lambda自动检测谷歌广告统计中的异常情况.
- 比较了用于时间序列异常检测的各种最先进的ML模型.
- 设置用于数据存储和Lambda执行的AWS帐户.
Senior Data Scientist
Glovo
- 设计、实现和部署客户生命周期价值模型. 我们使用Luigi将其部署在EC2实例上,并使用Jenkins进行调度.
- Used linear programming to optimize pickers' time shifts.
- 建立一个端到端的管道,根据产品在商店中可用的概率来决定是否在应用中显示产品,以改善客户体验. 该模型在SageMaker上进行训练,然后部署在EC2实例上.
Data Scientist
Credit Suisse
- 设计和部署机器学习模型,使用交易数据检测洗钱行为.
- 领导负面新闻筛选项目,自动筛选新闻数据,寻找与金融犯罪的关联,丰富风险评分模型.
- 使用NLP来衡量新闻数据对金融产品销售的影响.
- 组织大数据平台上各种交易和KYC数据源的数据来源和映射. 还处理了事务和KYC数据的数据模型的设计和实现,以促进事务监控.
Research Scholar
Carnegie Mellon University
- 设计并实施了一个模型,利用患者的大脑活动数据(多变量时间序列)预测心脏骤停后患者的生存。.
- 建立了一个评估,给早期预测生存的模型一个更好的分数.
- 聚集患者以确定共同特征并推断出具体的预防措施以提高其生存率.
Data Scientist Intern
Capgemini
- 搭建Spark集群,从HDFS读取传感器数据并进行预处理.
- 建立一个可扩展的监督模型,使用多变量时间序列数据(传感器数据)检测制造故障.
- Fine-tuned and validated the model. Identified main features leading to breakdowns.
Experience
Recommender System
In the first step, 我们使用非负矩阵分解(NMF)来找到两个矩阵W和H各自的大小(用户数), K) and (K, number of movies) that minimize the difference between V and WH where K is a small value (< 10). That means we look for W and H such as WH is close to V.
Afterward, 我们使用W和H对用户进行聚类,现在可以推荐他们所分配的聚类会喜欢的电影.
Face and Image Recognition
Skills
Languages
Python, SQL, R
Libraries/APIs
Scikit-learn, Pandas, PySpark, SpaCy, Natural Language Toolkit (NLTK), XGBoost, TensorFlow, Luigi, OpenCV, Node.js
Paradigms
数据科学,异常检测,线性编程,测试自动化
Other
Time Series Analysis, Natural Language Processing (NLP), Machine Learning, Artificial Intelligence (AI), Communication, GPT, Generative Pre-trained Transformers (GPT), Neural Networks, ARIMA Models, Sentiment Analysis, Cryptocurrency, Hugging Face, Analysis of Variance (ANOVA), APIs, Speech to Text, OCR, Decentralized Finance (DeFi), Trend Analysis, Digital Advertising, Computer Vision, Reinforcement Learning, PEFT, LoRa, Transformers
Platforms
Linux, Amazon Web Services (AWS), Docker, Kubernetes
Frameworks
Django
Tools
Amazon SageMaker, Bazel
Storage
Redshift, Google Cloud, Elasticsearch
Education
Master's Degree in Data Science
Grenoble Institute of Technology - Grenoble, France
Bachelor's Degree in Computer Science
Grenoble Institute of Technology - Grenoble, France
Certifications
Generative AI with Large Language Models
Coursera
Decentralized Finance (DeFi)
Coursera
AWS Solutions Architect Associate
Pearson VUE
Django for Everybody
University of Michigan | via Coursera