Metadata/data identification Java library. Identifies Base Type (e.g. Boolean, Double, Long, String, LocalDate, LocalTime, ...) and Semantic Type information (e.g. Gender, Age, Color, Country, ...). Extensive country/language support. Extensible via user-defined plugins. Comprehensive Profiling support. Large set of built-in Semantic Types (extensible via JSON defined plugins). Extensive Profiling metrics (e.g. Min, Max, Distinct, signatures, …) Sufficiently fast to be used inline. See Speed notes below. Minimal false positives for Semantic type detection. See Performance notes below. Usable in either Streaming, Bulk or Record mode. Broad country/language support - including US, Canada, Mexico, Brazil, UK, Australia, much of Europe, Japan and China. Support for sharded analysis (i.e. Analysis results can be merged) Once stream is profiled then subsequent samples can be validated and/or new samples can be generated.

Features

  • Large set of built-in Semantic Types (extensible via JSON defined plugins)
  • Extensive Profiling metrics (e.g. Min, Max, Distinct, signatures, …)
  • Minimal false positives for Semantic type detection
  • Usable in either Streaming, Bulk or Record mode
  • Broad country/language support - including US, Canada, Mexico, Brazil, UK, Australia, much of Europe, Japan and China
  • Support for sharded analysis (i.e. Analysis results can be merged)

Project Samples

Project Activity

See All Activity >

Categories

Data Profiling

License

Apache License V2.0

Follow Semantic Type Detection

Semantic Type Detection Web Site

Other Useful Business Software
Build AI Apps with Gemini 3 on Vertex AI Icon
Build AI Apps with Gemini 3 on Vertex AI

Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.
Try Vertex AI Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Semantic Type Detection!

Additional Project Details

Programming Language

Java

Related Categories

Java Data Profiling Tool

Registered

2023-06-12