Turning Complex Data Privacy Challenges into AI-Powered Intelligence
From Concept to Pilot to Legacy – My Journey as a Co-Founder of SensID
In a world where unstructured data explodes and regulations like GDPR demand full control over personal information, I envisioned a solution to tackle this challenge head-on.
That vision became SensID — an AI/ML-powered system for detecting, classifying, and governing personal and sensitive data across enterprise environments. As co-founder and CTO, I was the driving force behind SensID’s concept, design, and initial development. Today, it lives on as part of the 4Semantic platform, but the core ideas were born from my work
The idea for SensID emerged in 2016 when I observed enterprises struggling with:
Scattered sensitive data across databases, emails, documents, APIs.
Difficulty applying rigid data rules to messy, real-world text.
Mounting GDPR requirements: data audits, user rights, consent records.
I combined my background in data systems, AI/ML, and large-scale architecture to craft a solution: a system that discovers what you didn’t know you had and helps you manage it intelligently.
SensID is a modular data intelligence platform that enables:
Smart detection of personal data using Machine Learning and Natural Language Processing (NLP).
Classification and cataloging of structured and unstructured sources, including Polish-language content.
Real-time data governance aligned with GDPR and other privacy frameworks.
Anonymization and pseudonymization, integrated with ARX privacy models.
Custom visual dashboards and API access for enterprise integration.
I served as the originator of the idea and led the project as:
Chief Architect: Designed the product structure and processing flow.
ML/NLP Engineer: Built and trained models for named entity recognition and sensitive data classification.
CTO: Managed development, integration, and team workflows.
Business Developer: Presented the MVP to early adopters and oversaw our pilot with PKP (Polish State Railways).
Later, when external investors (Rubicon Partners) stepped in, I chose to leave the venture, entrusting its continued development to my co-founder. But the architecture, concept, and product DNA remain my creation.
SensID scans and analyzes data from:
File systems (PDF, DOCX, TXT, ZIP, XML)
SQL & NoSQL databases
Email servers and content platforms
APIs and data streams (Kafka, JMS, REST)
It identifies and classifies:
Personal identifiers (name, address, phone, PESEL, IP)
Financial and health data
Sensitive content (political views, biometric data, etc.)
The results are indexed and visualized through Kibana dashboards or made available via APIs for integration with compliance, CRM, or analytics platforms.
Precision data classification with contextual NLP (Polish support included)
Full lifecycle audit & reporting for sensitive data
Anonymization support via ARX integration
GDPR compliance toolkit (e.g., consent registry, data subject rights support)
REST APIs for plug-and-play usage in enterprise systems
GDPR audits and reporting
Customer identity resolution and profiling
Data Loss Prevention (DLP) and risk monitoring
Automated classification of documents and correspondence
Semantic search and sentiment analysis across enterprise text
Although I exited before the investment phase, I proudly claim SensID as one of my most meaningful innovations. The system’s vision, architecture, and early execution were mine. The fact that it evolved into a commercial product now integrated into 4Semantic shows the durability of the concept.
SensID stands as proof of what happens when technical foresight meets regulatory pressure, and how applied AI can create real business value.
As I continue working at the intersection of AI, cloud, and enterprise architecture, SensID remains a landmark project that reflects my ability to:
Identify and address emerging needs with innovation
Build intelligent systems from scratch
Lead teams across technical and strategic dimensions