What You’ll Deliver (POC Scope)
- An investment fund ontology (OWL 2 / RDF / Linked Data) modeling the product hierarchy: Fund > ShareClass > Listing > Security, with 73+ identifier types (ISIN, SEDOL, CUSIP, LEI, Bloomberg, and more) governed across US, UK, Ireland, Australia, Canada, and Mexico
- SHACL validation shapes encoding business rules as executable constraints – e.g., "A US‑domiciled ETF listing must have a valid ISIN" – that flag violations at fund launch and throughout the fund lifecycle
- SKOS concept schemes harmonizing vocabulary across 8+ internal systems, providing a semantic bridge between source‑of‑record platforms
- Data integration pipelines mapping relational data into RDF triples (Turtle/N‑Quads) using R2RML, rdflib, or equivalent mapping tooling
- API layer enabling downstream applications to consume graph data programmatically
- Automated governance reports surfacing data quality gaps, missing identifiers, and cross‑system inconsistencies
Qualifications
- 5+ years hands‑on experience building and deploying RDF/OWL ontologies in production (not solely academic – you have shipped something real)
- Experience with SHACL for constraint validation, data quality enforcement, and shape‑based governance
- Fluent in SPARQL 1.1 – complex queries, property paths, and federated queries
- At least one enterprise triple store / graph platform: TopBraid, GraphDB (Ontotext), Stardog, RDFox, Amazon Neptune (RDF mode), or equivalent
- Proficiency in Python or Java for data transformation, pipeline orchestration, and integration work
- Familiarity with financial or securities identifiers (ISIN, SEDOL, CUSIP, LEI, or similar scheme) – you need to understand what an identifier is and why it matters
- Working knowledge of data governance – you understand why constraints exist, not just how to code them
Preferred Experience
- Financial services domain: fund data, securities master, regulatory compliance (MiFID, UCITS, listing rules)
- FIBO (Financial Industry Business Ontology) or similar financial ontologies
- TopBraid EDG / GraphWise (our target platform – evaluation input from this hire is welcome)
- SKOS for vocabulary management and cross‑system term harmonization
- R2RML / RML or equivalent relational‑to‑RDF mapping standards
- Ontology design patterns, modular schema design, ontology versioning (Git‑based workflows)
- Enterprise‑scale data modeling (thousands of entities, multi‑market, multi‑product‑type)
- Exposure to rules engines, inference, or deterministic AI (SHACL‑based reasoning, business rules automation)
- Agile delivery in a regulated environment