Patrick Chan’s Post

Just concluded 1st-place in Kaggle with a great team consisting of Geremie Yeo, Gaurav Rawat, Young Min Paik, and Yevhenii Maslov in the PII Data Detection Competition, competing against over 2000 teams. Many thought NER is a simple task, not when you are faced with real world cases. See that Presidio just do not cut the task alone :) Our approach combined post-processing, ensemble techniques, and the training of transformer models like Deberta, Longformer and other models across various architectures, such as multi-dropout and BiLSTM, and knowledge distillation. Solution Write-up: https://lnkd.in/g5rzkiMx We continue to investigate various emerging LLM architectures and found that code pretrained LLMs are great at identifying NER too. Due to time constraints and work commitment, we have yet to integrate that to our solution. But the ensembles done by the team is already strong enough to land 1st. Cheers to the good team effort!

  • No alternative text description for this image
Mingjie Wang

Quant | Kaggle Competition Master

4mo

Congratulations 

Like
Reply
Fethi Filali, PhD

Director of Technology & Research | R&D and Innovation in Applied AI & Smart Cities | Investor | Full Stack AI/ML Expertise | Lifelong & Hands-on Learner | 8 years as Prof. + 15+ in Industry | Public Speaker

3mo

Congrats to you and the team, Patrick Chan! We've been testing with NER for some time, and I completely agree with you that it's not a simple task, especially when considering real-world use cases and trying to augment existing products! 👏

Congratulations Dr.

Like
Reply
Yeongung Seo

Senior Researcher in MLLM, VLM (Kaggle Master)

4mo

Congratulations !

Jitendra Upadhyay

Director lead model developer

2mo

Congrats Dr Patrick

Like
Reply
Rohit Pattnaik

Senior Data Scientist at ExxonMobil | ChatGPT | AI

3mo

Congrats Dr. Patrick ... Great going!

Like
Reply
Tushar Sankhe

Global Practice Leader-Analytics COE and Artificial Intelligence Practice | Digital Transformation| Cloud Platforms | Data & AI I Machine Learning | Products | Innovation | Thought Leader | Responsible AI , Ethics

3mo

Great effort indeed ! Congratulations Dr. Patrick and team.

Like
Reply
Saravanan Rajamanickam

Lead Data Scientist II @ A*STAR | Expert in Generative AI, NLP, LLM

4mo

Congratulations Dr Patrick!

Like
Reply
Benjamin K.

Director at Amaris.AI

4mo

Congratulations Dr Patrick!

Like
Reply
See more comments

To view or add a comment, sign in

Explore topics