Metadata/data identification Java library. Identifies Base Type (e.g. Boolean, Double, Long, String, LocalDate, LocalTime, ...) and Semantic Type information (e.g. Gender, Age, Color, Country, ...). Extensive country/language support. Extensible via user-defined plugins. Comprehensive Profiling support. Large set of built-in Semantic Types (extensible via JSON defined plugins). Extensive Profiling metrics (e.g. Min, Max, Distinct, signatures, …) Sufficiently fast to be used inline. See Speed notes below. Minimal false positives for Semantic type detection. See Performance notes below. Usable in either Streaming, Bulk or Record mode. Broad country/language support - including US, Canada, Mexico, Brazil, UK, Australia, much of Europe, Japan and China. Support for sharded analysis (i.e. Analysis results can be merged) Once stream is profiled then subsequent samples can be validated and/or new samples can be generated.

Features

  • Large set of built-in Semantic Types (extensible via JSON defined plugins)
  • Extensive Profiling metrics (e.g. Min, Max, Distinct, signatures, …)
  • Minimal false positives for Semantic type detection
  • Usable in either Streaming, Bulk or Record mode
  • Broad country/language support - including US, Canada, Mexico, Brazil, UK, Australia, much of Europe, Japan and China
  • Support for sharded analysis (i.e. Analysis results can be merged)

Project Samples

Project Activity

See All Activity >

Categories

Data Profiling

License

Apache License V2.0

Follow Semantic Type Detection

Semantic Type Detection Web Site

Other Useful Business Software
Easily Host LLMs and Web Apps on Cloud Run Icon
Easily Host LLMs and Web Apps on Cloud Run

Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
Try Cloud Run Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Semantic Type Detection!

Additional Project Details

Programming Language

Java

Related Categories

Java Data Profiling Tool

Registered

2023-06-12