Access Governance Scan Scaling
Re-architected access scanning for SMB, OneDrive, and Microsoft API sources using parallel batch processing and throughput-focused backend design.
Increased SMB and OneDrive scan throughput by 22x, from 2M to 45M records per day, and Microsoft API throughput by 4x, from 1M to 4M records per day.
22x SMB/OneDrive
45M records/day
4x Microsoft APIs
Parallel batchingAccess governanceThroughput optimizationProduction scaling
Actual Access Audit-Log Pipeline
High-volume real-time ingestion pipeline for audit logs from SharePoint, Google Drive, and NetApp.
Created the data stream used as a foundation for downstream ransomware detection and automated response features.
Real-time logs
3 major sources
Detection foundation
SharePointGoogle DriveNetAppAudit logsRansomware detection
Enterprise Data-Source Integrations
Built and maintained primary enterprise data-source connectors from zero-to-one, including Azure Blob, Salesforce, Box, and Dropbox.
Expanded the platform's data coverage while keeping integrations maintainable, scalable, and production-ready.
10+ integrations
Zero-to-one builds
Production ownership
Azure BlobSalesforceBoxDropboxCloud connectors
Lightbeam Lens Analytics Ingestion
Backend ingestion service for Lightbeam Lens, using ClickHouse to serve enterprise analytics over file metadata and lineage.
Built optimized ingestion and views that help visualize metadata relationships and lineage across enterprise storage systems.
ClickHouse
Metadata lineage
Enterprise analytics
Lightbeam LensClickHouseOptimized viewsFile metadataLineage
Quickget Marketplace & Operations Automation
Bi-directional Uber Eats integration plus operational tooling across Google Maps, Airtable, and Slack.
Enabled real-time inventory syncing and automated order ingestion, opening a new sales channel and increasing daily order volume by 30%.
30% order lift
Real-time inventory
Automated ingestion
Uber EatsGoogle MapsAirtableSlackCampaign services
Google Summer of Code 2020 · OpenMined
Implemented the FV homomorphic encryption scheme from scratch in PySyft and improved usability for privacy-preserving machine learning workflows.
Used Microsoft's SEAL project as a reference and shaped the work into a more usable library path for PyTorch and TensorFlow users.
5 months
FV scheme
Deep learning integration
PySyftOpenMinedHomomorphic encryptionPyTorchTensorFlowMicrosoft SEAL