Semantic ID Generator - NPM Package

Contents
- Introduction
- Quick Start
- Key Features
- Domain Presets & Schema Export
- Usage & Configuration
- Testing & Performance
- Documentation Map
- License
- About the Author
Introduction
Semantic ID Generator is a Node.js package for minting human-readable, machine-understandable identifiers composed of configurable “compartments.” Each compartment has a semantic meaning and a generation strategy so IDs stay unique, recognizable, and consistent across systems.
Latest Updates (v1.3.0)
- ✅ Full-plane Unicode IDs – printable code points from the entire Unicode range.
- ✅ Safer Passphrases – cached word lists and automatic separator filtering.
- ✅ Cleaner TypeScript Tooling – dedicated
ts-nodebootstrap script. - ✅ Expanded Test Coverage – numeric entropy, Unicode behavior, performance thresholds.
- ✅ Domain Presets & Graph Schemas – 30 ready-made configurations plus JSON-LD/OWL exports.
Quick Start
Install
npm install semantic-id-generator
Basic usage
import SemanticIDGenerator from 'semantic-id-generator';
const generator = new SemanticIDGenerator();
const id = generator.generateSemanticID('person');
console.log(id);
Runnable Code Samples
Every major feature now has a self-contained sample under code_samples/:
- Basic ID generation (default + custom configs)
- String strategies (Base64 payloads, language-scoped passphrases)
- Domain presets (preset hydration + metadata inspection)
- Schema export (JSON-LD and OWL artifacts)
- TypeScript tooling (builder pattern + runtime guards)
Each folder contains a README.md plus a runnable script, e.g.
node code_samples/domain-presets/generate-contract-id/sample.js
node --import ./scripts/register-ts-node.mjs code_samples/typescript-tooling/builder-pattern/sample.ts
MCP Integration (Server + Client)
Need to let AI copilots such as Cursor or Claude invoke the Semantic ID Generator directly? A dedicated MCP companion package now lives under packages/semantic-id-generator-mcp. It keeps the core library lean (no MCP dependencies) while providing:
semantic-id-generator-mcp-server– stdio MCP server exposing tools to generate IDs, inspect IDs, and list presets.semantic-id-generator-mcp-client– a minimal stdio client (inspired by the official MCP tutorials) for local testing or scripted automation.
Usage
cd packages/semantic-id-generator-mcp
npm install
# Start the stdio server (register this command inside Cursor/Claude)
npx --package semantic-id-generator-mcp semantic-id-generator-mcp-server --default-preset dataset
# Optional: drive the tools from a terminal shell
npx semantic-id-generator-mcp-client --tool generate-semantic-id --args '{"dataConceptName":"contract","preset":"contract"}'
The server follows the latest MCP specification so any compatible AI model client can connect, discover the tools, and receive structured responses.
Testing
# Inside packages/semantic-id-generator-mcp
npm test # runs in-memory + stdio MCP integration tests
The MCP suite covers protocol-level behavior (tool/resource calls) and also spawns the actual stdio server to verify initialization works exactly the way Cursor or Claude would invoke it.
Key Features
- Configurable compartments – mix strategies (visible, numeric, base64, passphrase, etc.).
- 30 domain presets – hydrate a full configuration with
preset: '<domain>'. - Machine-readable semantics – export JSON-LD & OWL straight into graph stores.
- Inspector & validator – parse existing IDs, auto-detect presets, and flag compartment-level issues.
- TypeScript-first DX – full typings, builders, and runtime validation helpers.
- Battle-tested – extensive mocha suites, TypeScript compilation tests, and published benchmarks.
Domain Presets & Schema Export
Pick from 30 presets—Person, Contract, Dataset, Device, FinancialAccount, and more—to bootstrap identifiers and schemas instantly.
| Preset | Schema | Subclass of |
|---|---|---|
person |
Person | schema:Person |
individual_customer |
IndividualCustomer | schema:Person |
corporate_customer |
CorporateCustomer | schema:Organization |
employee |
Employee | schema:Person |
supplier |
Supplier | schema:Organization |
partner |
Partner | schema:Organization |
organization |
Organization | schema:Organization |
department |
Department | schema:Organization |
role |
Role | schema:Role |
product |
Product | schema:Product |
product_category |
ProductCategory | schema:CategoryCodeSet |
device |
Device | schema:Product |
asset |
Asset | schema:Product |
inventory_item |
InventoryItem | schema:Product |
contract |
Contract | schema:Contract |
order |
Order | schema:Order |
purchase_order |
PurchaseOrder | schema:Order |
invoice |
Invoice | schema:Invoice |
shipment |
Shipment | schema:ParcelDelivery |
payment_transaction |
PaymentTransaction | schema:PaymentService |
financial_account |
FinancialAccount | schema:FinancialProduct |
budget |
Budget | schema:FinancialProduct |
project |
Project | schema:Project |
task |
Task | schema:Action |
support_case |
SupportCase | schema:Action |
document |
Document | schema:CreativeWork |
policy_document |
PolicyDocument | schema:CreativeWork |
location |
Location | schema:Place |
event |
Event | schema:Event |
dataset |
Dataset | schema:Dataset |
Programmatic benefits
- Single-flag configuration – no repeated separators/compartments across services.
- Metadata-aware tooling – retrieve schema names, descriptions, and inheritance on demand.
- Graph-ready exports – JSON-LD and OWL stay in lockstep with the IDs you generate.
import SemanticIDGenerator, {
getPresetMetadata,
buildSchemaForPreset
} from 'semantic-id-generator';
const generator = new SemanticIDGenerator({ preset: 'contract' });
const id = generator.generateSemanticID('contract');
const metadata = getPresetMetadata('contract');
const { jsonld } = buildSchemaForPreset('contract');
console.log(id);
console.log(metadata.schemaClass); // schema:Contract
console.log(jsonld['sig:entitySchema']); // Contract
👉 Full catalog, metadata APIs, and schema export details live in docs/domain-presets.md.
Semantic ID Inspector
Use SemanticIDInspector when you need to validate or explain IDs that were minted elsewhere (partner systems, historical datasets, governance bots, etc.). The inspector automatically detects domain presets from the data concept prefix, rehydrates the expected configuration, and returns a structured report per compartment.
import SemanticIDGenerator, { SemanticIDInspector } from 'semantic-id-generator';
const generator = new SemanticIDGenerator({ preset: 'contract' });
const inspector = new SemanticIDInspector();
const id = generator.generateSemanticID('contract');
const report = inspector.inspect(id);
console.log(report.isValid); // true
console.log(report.preset); // "contract"
console.log(report.metadata.schemaClass); // "schema:Contract"
const [concept, rest] = id.split('|');
const [first, second, third] = rest.split('-');
const tamperedSecond = `${second.slice(0, 2)}|${second.slice(2)}`;
const tamperedId = `${concept}|${first}-${tamperedSecond}-${third}`;
const failure = inspector.inspect(tamperedId);
console.log(failure.isValid); // false
console.log(failure.compartments[1].issues);
// ["Value contains reserved separator \"|\".", "..."]
- Instantiate the inspector without arguments to rely on preset auto-detection.
- Pass a configuration or preset (either in the constructor or per-call
inspect(id, overrides)) to validate bespoke separators or custom compartment layouts. - Each inspection result surfaces global issues, compartment-level diagnostics, and (when available) the preset metadata so you can feed the findings straight into governance dashboards or ETL quality gates.
👉 Run the dedicated sample in code_samples/semantic-id-inspector/basic-validation to see both a passing and failing inspection end-to-end.
Usage & Configuration
- Advanced configuration patterns (custom compartments, strategies, passphrase languages)
- TypeScript builders, validation helpers, and error handling
- Default values and detailed strategy descriptions
👉 See docs/usage.md for the complete guide, plus the runnable sample in examples/typescript-example.ts.
Testing & Performance
npm testruns the mocha suites, performance checks, and schema snapshot tests.npm run test:typescriptvalidates downstream TypeScript consumption.- Documented throughput numbers for every strategy plus security guarantees around
crypto.
👉 Full details in docs/testing-and-performance.md.
Documentation Map
| Topic | Location |
| — | — |
| Domain presets & schemas | docs/domain-presets.md |
| Usage & configuration recipes | docs/usage.md |
| Testing & performance | docs/testing-and-performance.md |
| TypeScript example app | examples/typescript-example.ts |
| Test-suite overview | test/Test_Suite.md |
License
This project is licensed under the MIT License.
About the Author
Semantic ID Generator is created by Yannick Huchard (CTO).
More information: yannickhuchard.com · Podcast · Medium · YouTube · amase.io
Semantic ID Generator - NPM Package

Table of Contents
Introduction
Semantic ID Generator is a Node.js package designed to generate structured and meaningful unique identifiers, named “Semantic ID”. These identifiers are composed of different “compartments” each having a specific “semantic meaning” and generation strategy.
Latest Updates (v1.3.0):
- ✅ Full-plane Unicode IDs: “All characters” strategy now emits printable code points across the entire Unicode range while guaranteeing the configured compartment length.
- ✅ Safer Passphrases: Word lists are cached once and any word containing your separators is skipped automatically to avoid malformed IDs.
- ✅ Cleaner TypeScript Tooling: The
ts-nodeloader now runs via a dedicated bootstrap script, eliminating experimental warnings in CI. - ✅ Expanded Test Coverage: New suites verify numeric/alphanumeric entropy, Unicode behavior, passphrase caching, and performance thresholds.
- ✅ Domain Presets & Graph Schemas: Ship-ready configurations for
person,organization,device, andfinancial_account, plus JSON-LD and OWL exports so IDs drop straight into Neo4j, Neptune, or any RDF store.
What is a Semantic Identifier?
A Semantic ID is an identifier that implements following AMASE data architecture principles:
- Unique
- Recognizable by humans and artificial intelligence
- Semantically coherent according to semantic rules defined by the data architects/engineers
- Consistent across all data spaces, including historical changes
- Generated from a configurable Factory
A semantic id follows the pattern:
{date concept name}{name separator}{compartment 1}{compartment separator}{compartment 2}{compartment separator}...{compartment N}
Examples
Here a few examples of generated semantic identifiers:
- A person ID:
person|AbCd-12345678-9ABCDEF0123456789ABCDEF0 - An organization ID:
organization|7890-MNO56789-123456789ABCDEF01234 - A multicurrency account:
multicurrency_account|XYZ2-87654321-ABCD5678901234567890 - A device ID:
device_id|A1B2-135792468-EDCBA098765432109876
Domain Presets & Schema Export
To remove repetitive configuration plumbing, the generator now ships 30 core presets that cover the most common data entities. Each preset pairs a ready-to-use generator configuration with metadata and schemas whose names match the entity while inheriting from a well-known vocabulary:
| Preset | Schema | Subclass of | Description |
|---|---|---|---|
person |
Person | schema:Person |
Individual people |
individual_customer |
IndividualCustomer | schema:Person |
Customers who are people |
corporate_customer |
CorporateCustomer | schema:Organization |
Customers that are organizations |
employee |
Employee | schema:Person |
Workforce members |
supplier |
Supplier | schema:Organization |
Vendors or suppliers |
partner |
Partner | schema:Organization |
Strategic/channel partners |
organization |
Organization | schema:Organization |
Generic legal entities |
department |
Department | schema:Organization |
Internal cost centers |
role |
Role | schema:Role |
Functional or security roles |
product |
Product | schema:Product |
Catalog items |
product_category |
ProductCategory | schema:CategoryCodeSet |
Product taxonomies |
device |
Device | schema:Product |
Physical/IoT devices |
asset |
Asset | schema:Product |
Managed assets |
inventory_item |
InventoryItem | schema:Product |
Stock units |
contract |
Contract | schema:Contract |
Legal agreements |
order |
Order | schema:Order |
Customer orders |
purchase_order |
PurchaseOrder | schema:Order |
Procurement POs |
invoice |
Invoice | schema:Invoice |
A/R or A/P invoices |
shipment |
Shipment | schema:ParcelDelivery |
Logistics movements |
payment_transaction |
PaymentTransaction | schema:PaymentService |
Settlements/payments |
financial_account |
FinancialAccount | schema:FinancialProduct |
Accounts, wallets, ledgers |
budget |
Budget | schema:FinancialProduct |
Budget envelopes |
project |
Project | schema:Project |
Initiatives/projects |
task |
Task | schema:Action |
Tasks or work items |
support_case |
SupportCase | schema:Action |
Support/issue cases |
document |
Document | schema:CreativeWork |
Documents/files |
policy_document |
PolicyDocument | schema:CreativeWork |
Policies/standards |
location |
Location | schema:Place |
Physical/logical locations |
event |
Event | schema:Event |
Events or campaigns |
dataset |
Dataset | schema:Dataset |
Analytical/operational datasets |
Programmatic benefits
- Single-flag configuration – pass
preset: '<domain>'once and get the full separator/compartment setup automatically. - Metadata-aware tooling – query
getPresetMetadata()orgetDomainPreset()at runtime to drive CLIs, governance bots, or AI agents. - Graph-ready exports – call
buildSchemaForPreset()/exportSchema()to produce JSON-LD or OWL that mirrors the generated IDs, ready for Neo4j, Neptune, or any RDF store.
import SemanticIDGenerator, {
getPresetMetadata,
buildSchemaForPreset
} from 'semantic-id-generator';
const generator = new SemanticIDGenerator({ preset: 'contract' });
const id = generator.generateSemanticID('contract');
const metadata = getPresetMetadata('contract'); // { schemaName: 'Contract', ... }
const { jsonld } = buildSchemaForPreset('contract');
console.log(id);
console.log(metadata.schemaClass); // schema:Contract
console.log(jsonld['sig:entitySchema']); // Contract
Use the preset field when instantiating the generator:
import SemanticIDGenerator from 'semantic-id-generator';
const generator = new SemanticIDGenerator({ preset: 'person' });
const id = generator.generateSemanticID('person');
console.log(id);
You can also inspect preset definitions programmatically:
import { getDomainPreset, getPresetMetadata, listDomainPresets } from 'semantic-id-generator';
console.log(listDomainPresets()); // ['person', 'individual_customer', ... , 'dataset']
console.log(getDomainPreset('device')); // { dataConceptSeparator: '|', compartmentSeparator: '-', ... }
console.log(getPresetMetadata('device'));
/* {
* key: 'device',
* schemaName: 'Device',
* schemaClass: 'schema:Product',
* description: 'Identifiers for physical or IoT devices.'
* }
*/
Finally, export the accompanying JSON-LD or OWL schemas when you want to feed knowledge graphs:
import { buildSchemaForPreset, exportSchema } from 'semantic-id-generator';
const { jsonld, owl } = buildSchemaForPreset('contract');
console.log(jsonld['sig:entitySchema']); // "Contract"
console.log(jsonld['sig:domainClass']); // "schema:Contract"
const owlXml = exportSchema('contract', 'owl');
// Persist jsonld / owl artifacts or push them to your graph store.
- JSON-LD and OWL files are bundled under
/schema/<preset>.jsonldand/schema/<preset>.owl. - Maintainers can regenerate these artifacts with
npm run build:schema, which runsscripts/build-schemas.mjs.
String Generation Strategies
Semantic ID Generator uses different string generation strategies to generate each compartment of the semantic ID. Here are the currently available strategies:
- all characters: This strategy emits printable characters from the full Unicode plane (including emoji) while respecting the requested compartment length by skipping any code point that would overflow the remaining space.
- visible characters: This strategy generates a string that only includes visible Unicode characters.
- numbers: This strategy generates a string that only includes numeric characters (0-9).
- alphanumeric: This strategy generates a string that includes both alphabetic (A-Z, a-z) and numeric (0-9) characters.
- hexadecimal: This strategy generates a string that includes hexadecimal characters (0-9, a-f).
- base64: This strategy generates a string that includes Base64 characters (A-Z, a-z, 0-9, +, /).
- passphrase: This strategy generates a string using common words from multiple languages (English, French, Spanish, Italian, German, Dutch, Wolof), creating human-readable passphrases and automatically skips any word that contains your configured separators.
These strategies can be assigned to each compartment in the Semantic ID Generator configuration. This allows you to customize the generation of each part of the semantic ID according to your requirements.
Installation
Prerequisites
Before you can use the Semantic ID Generator, you must have certain software installed on your computer:
-
Node.js: This is the JavaScript runtime in which the Semantic ID Generator runs. You can download it from https://nodejs.org.
-
NPM: This is the package manager for Node.js. It is included with the Node.js installation.
After installing Node.js and NPM, you need to install the dependencies of the Semantic ID Generator:
- uuid: This is an NPM package that allows you to generate UUIDs. You can install it with the following command:
npm install uuid
Then install the Semantic ID Generator library
npm install semantic-id-generator
ES Module Support
This package is now a pure ES module. To use it in your project:
For ES Module projects (recommended):
import SemanticIDGenerator from 'semantic-id-generator';
For CommonJS projects:
const SemanticIDGenerator = await import('semantic-id-generator');
const generator = new SemanticIDGenerator.default();
TypeScript Support
The library includes full TypeScript support with comprehensive type definitions. If you’re using TypeScript, you’ll get:
- Full IntelliSense support in your IDE
- Type checking for all configuration options
- Autocomplete for generation strategies and language codes
- Compile-time error detection
import SemanticIDGenerator, {
SemanticIDGeneratorConfig,
Compartment,
GenerationStrategy,
LanguageCode
} from 'semantic-id-generator';
// Type-safe configuration
const config: SemanticIDGeneratorConfig = {
dataConceptSeparator: '|',
compartmentSeparator: '-',
compartments: [
{ name: 'prefix', length: 8, generationStrategy: 'visible characters' },
{ name: 'base64_part', length: 24, generationStrategy: 'base64' },
{ name: 'suffix', length: 12, generationStrategy: 'hexadecimal' }
]
};
const generator = new SemanticIDGenerator(config);
const id = generator.generateSemanticID('document');
Usage
Basic Usage
import SemanticIDGenerator from 'semantic-id-generator';
const generator = new SemanticIDGenerator();
const id = generator.generateSemanticID('person');
console.log(id); // Outputs looks like 'person|abcd-abcdefgh-abcdefghijkl'
Advanced Usage
import SemanticIDGenerator from 'semantic-id-generator';
const config = {
dataConceptSeparator: '|',
compartmentSeparator: '-',
compartments: [
{ name: 'part1', length: 10, generationStrategy: "visible characters"},
{ name: 'part2', length: 10, generationStrategy: "numbers"},
{ name: 'part3', length: 32, generationStrategy: "hexadecimal"}
]
};
const generator = new SemanticIDGenerator(config);
const id = generator.generateSemanticID('person');
console.log(id);
Example with Base64 strategy:
import SemanticIDGenerator from 'semantic-id-generator';
const config = {
dataConceptSeparator: '|',
compartmentSeparator: '-',
compartments: [
{ name: 'prefix', length: 8, generationStrategy: "visible characters"},
{ name: 'base64_part', length: 24, generationStrategy: "base64"},
{ name: 'suffix', length: 12, generationStrategy: "hexadecimal"}
]
};
const generator = new SemanticIDGenerator(config);
const id = generator.generateSemanticID('document');
console.log(id); // Example: 'document|Kj8mNx2-AbCdEfGhIjKlMnOpQrStUv-1a2b3c4d5e6f'
Example with Passphrase strategy:
import SemanticIDGenerator from 'semantic-id-generator';
const config = {
dataConceptSeparator: '|',
compartmentSeparator: '-',
compartments: [
{ name: 'prefix', length: 8, generationStrategy: "visible characters"},
{ name: 'passphrase_part', length: 25, generationStrategy: "passphrase"},
{ name: 'suffix', length: 12, generationStrategy: "hexadecimal"}
]
};
const generator = new SemanticIDGenerator(config);
const id = generator.generateSemanticID('user_session');
console.log(id); // Example: 'user_session|Kj8mNx2-applebananacherrydragon-1a2b3c4d5e6f'
**Example with language configuration:**
```javascript
import SemanticIDGenerator from 'semantic-id-generator';
// Default behavior: uses all languages
const defaultConfig = {
dataConceptSeparator: '|',
compartmentSeparator: '-',
compartments: [
{ name: 'passphrase', length: 25, generationStrategy: "passphrase"}
]
};
const defaultGenerator = new SemanticIDGenerator(defaultConfig);
const mixedId = defaultGenerator.generateSemanticID('session');
console.log(mixedId); // Example: 'session|applepommemanzanaapfel'
// Specific language configuration
const englishConfig = {
dataConceptSeparator: '|',
compartmentSeparator: '-',
languageCode: 'eng',
compartments: [
{ name: 'passphrase', length: 25, generationStrategy: "passphrase"}
]
};
const frenchConfig = {
dataConceptSeparator: '|',
compartmentSeparator: '-',
languageCode: 'fra',
compartments: [
{ name: 'passphrase', length: 25, generationStrategy: "passphrase"}
]
};
const englishGenerator = new SemanticIDGenerator(englishConfig);
const frenchGenerator = new SemanticIDGenerator(frenchConfig);
const englishId = englishGenerator.generateSemanticID('session'); // English only
const frenchId = frenchGenerator.generateSemanticID('session'); // French only
console.log(englishId); // Example: 'session|applebananacherrydragon'
console.log(frenchId); // Example: 'session|pommebananejardinmaison'
Supported Languages:
eng- Englishfra- Frenchspa- Spanishita- Italiandeu- Germannld- Dutchwol- Wolof
Note: By default, the passphrase strategy uses words from all languages. To restrict to a specific language, add languageCode to the configuration.
### TypeScript Examples
For comprehensive TypeScript examples, see the `examples/typescript-example.ts` file. Here are some highlights:
**Type-safe configuration building:**
```typescript
class ConfigurationBuilder {
private config: SemanticIDGeneratorConfig = {};
setDataConceptSeparator(separator: string) {
this.config.dataConceptSeparator = separator;
return this;
}
setCompartmentSeparator(separator: string) {
this.config.compartmentSeparator = separator;
return this;
}
addCompartment(compartment: Compartment) {
if (!this.config.compartments) {
this.config.compartments = [];
}
this.config.compartments.push(compartment);
return this;
}
setLanguageCode(languageCode: LanguageCode) {
this.config.languageCode = languageCode;
return this;
}
build() {
return { ...this.config };
}
}
const config = new ConfigurationBuilder()
.setDataConceptSeparator('|')
.setCompartmentSeparator('-')
.addCompartment({ name: 'prefix', length: 6, generationStrategy: 'visible characters' })
.addCompartment({ name: 'uuid', length: 32, generationStrategy: 'hexadecimal' })
.setLanguageCode('eng')
.build();
Type validation utilities:
function validateStrategy(strategy: GenerationStrategy): boolean {
const allowed: GenerationStrategy[] = [
'all characters',
'visible characters',
'numbers',
'alphanumeric',
'hexadecimal',
'base64',
'passphrase',
];
return allowed.includes(strategy);
}
function validateLanguageCode(code: LanguageCode): boolean {
const allowed: LanguageCode[] = ['eng', 'fra', 'spa', 'ita', 'deu', 'nld', 'wol'];
return allowed.includes(code);
}
// Validate generation strategies
console.log(validateStrategy('base64')); // true
console.log(validateStrategy('invalid' as GenerationStrategy)); // false
// Validate language codes
console.log(validateLanguageCode('eng')); // true
console.log(validateLanguageCode('invalid' as LanguageCode)); // false
Error handling with TypeScript:
try {
const generator = new SemanticIDGenerator();
const id = generator.generateSemanticID('test');
} catch (error) {
// TypeScript knows this is an Error
console.error((error as Error).message);
}
Graph-friendly Schema Export
Each preset includes a machine-readable specification so you can publish identifier semantics to knowledge graphs:
import { buildSchemaForPreset, exportSchema } from 'semantic-id-generator';
const { jsonld, owl } = buildSchemaForPreset('person');
console.log(jsonld['sig:compartments'][0]['schema:name']); // semantic_prefix
const owlString = exportSchema('person', 'owl');
// => RDF/XML string ready for Neo4j, Neptune, Blazegraph, etc.
- JSON-LD and OWL files are bundled under
/schema/<preset>.jsonldand/schema/<preset>.owl. - Contributors can regenerate these artifacts with
npm run build:schema, which runsscripts/build-schemas.mjs.
Default Values
If you do not specify certain configuration options when creating a new Semantic ID Generator, the library uses the following default values:
-
dataConceptSeparator:
|. It separates the data concept name from the rest of the semantic ID. -
compartmentSeparator:
-. It separates the different compartments within the semantic ID. -
compartments: By default, the Semantic ID Generator uses three compartments. The compartments’ names are ‘part1’, ‘part2’, and ‘part3’, and their lengths are 4, 8, and 12 characters, respectively.
-
generationStrategy: “visible characters”. This strategy generates strings using visible Unicode characters (excluding separators).
Performance
The Semantic ID Generator is designed for high performance and security. Here are the performance benchmarks from our test suite:
Performance Benchmarks
General Performance:
- 100,000 IDs generation: ~11 seconds (0.11ms per ID)
- Default configuration: 3 compartments with visible characters strategy
Unicode String Generation:
- 10,000 Unicode strings (10 chars each): ~565ms (0.056ms per string)
- 100 large Unicode strings (1000 chars each): ~700ms (7ms per string)
- 1000 Unicode strings (50 chars each): ~300ms (0.3ms per string)
String Generation Strategies Performance:
- Visible characters: Fastest, uses ASCII range (0x0020-0x007E)
- Numbers: Very fast, uses only digits (0-9)
- Alphanumeric: Fast, uses A-Z, a-z, 0-9
- Hexadecimal: Fast, uses 0-9, a-f
- Base64: Fast, uses A-Z, a-z, 0-9, +, /
- Passphrase: Fast, uses common English words (a-z only)
- All characters: Slower due to Unicode complexity, uses full Unicode range
Security Features
All string generation uses cryptographically secure random number generation:
- Uses Node.js
crypto.randomBytes()andcrypto.randomInt() - Provides high entropy and unpredictability
- Suitable for production environments requiring security
- Latest security updates: All dependencies updated to latest secure versions (August 2025)
Test Coverage
The library includes comprehensive test coverage:
Test Suites:
- 01-semantic-id-generator.test.js: Basic functionality and ID generation
- 02-check-configuration.test.js: Configuration validation and error handling
- 03-check-performances.test.js: Performance benchmarking
- 04-unicode-string-generation.test.js: Unicode string generation optimization
- 05-base64-strategy.test.js: Base64 string generation strategy testing
- 06-passphrase-strategy.test.js: Passphrase string generation strategy testing
- typescript-compilation.test.ts: TypeScript compilation verification
- typescript-definitions.test.ts: TypeScript definitions testing
Test Categories:
- Functionality tests (ID generation, validation)
- Configuration tests (error handling, edge cases)
- Performance tests (speed benchmarks)
- Unicode optimization tests (security and efficiency)
- TypeScript compilation and type checking tests
Total Tests: 48+ passing tests covering all aspects of the library.
Current Dependencies
Production Dependencies:
uuid: ^11.1.0 (Latest secure version)
Development Dependencies:
@types/chai: ^5.2.2@types/mocha: ^10.0.10@types/node: ^24.2.0chai: ^5.2.1mocha: ^11.7.1ts-node: ^10.9.2typescript: ^5.9.2
All dependencies are updated to their latest secure versions as of August 2025.
Run unit tests
Run from your command line interface (Bash, Ksh, Windows Terminal, etc.):
npm test
For TypeScript-specific tests:
npm run test:typescript
License
This project is licensed under MIT License.
About the Author
Semantic ID Generator is created by Yannick Huchard - CTO. For more information, visit:
-
yannickhuchard.com Podcast Medium Youtube - More about AMASE, enterprise engineering/architecture system for businesses and startups: amase.io.