SilentEight logo

Data Scientist

SilentEight
3 days ago
Full-time
Remote
Singapore

About Us

We're building AI-native tools that harness the power of large language models (LLMs) to help customers solve high-impact, real-world problems in financial crime compliance. Our platform integrates structured and unstructured data, enabling rapid prototyping, seamless collaboration, and fast iteration with a strong focus on end-to-end delivery.

As we scale, we're looking for a data scientist who thrives at the intersection of research, implementation, and cross-functional collaboration — someone who can own analytical work end-to-end and contribute directly to customer-facing solutions.

What You’ll Do

  • Lead data exploration and analysis on large scale financial crime datasets — including sanctions, PEP (Politically Exposed Persons), and adverse media data — to uncover patterns, identify false positives/negatives, and drive feature improvements.

  • Develop and evaluate agents and rule-based models by running experiments, validating hypotheses, and fine-tuning thresholds to improve alert efficiency.

  • Build and deliver production-ready API integrations — coordinating with software engineers and product teams to ensure components are properly integrated, tested, and merged.

  • Conduct customer-focused data studies across multiple enterprise clients (e.g., financial institutions) to benchmark model performance, assess data quality, and propose data driven solutions to reduce investigation loads.

  • Prototype and iterate quickly — using PySpark, Jupyter notebooks, and Python to explore data, build reproducible pipelines, and generate insights that inform product decisions.

  • Investigate and resolve product issues in collaboration with engineering and product teams.

  • Contribute to R&D on emerging techniques — including graph-based approaches (GNNs, graph embeddings) for transaction monitoring, LLM-based feature exploration, and RAG-based models.

  • Communicate findings clearly through well-organized Jupyter notebooks, internal documentation, and stakeholder presentations, translating complex analytical results into actionable business insights.

What We’re Looking For

  • Based in Singapore or London (remote-first team; flexible working environment).

  • Bachelor's or Master's degree in Data Science, Computer Science, Statistics, or a related field.

  • Minimal 3 years of hands-on experience delivering data science projects, ideally in financial crime compliance, name screening, or AML/KYC domains.

  • Strong proficiency in Python (data manipulation, modelling, pipeline development) and SQL / Spark SQL for large-scale data querying and transformation.

  • Hands-on experience with PySpark or similar distributed data platforms.

  • Familiarity with NLP techniques, and entity resolution concepts.

  • Experience working with LLMs or RAG-based models for information extraction or classification tasks is an advantage.

  • Solid understanding of data quality assessment, including profiling, anomaly identification, and merging logic across complex multi-source datasets.

  • Comfortable working in Git, Docker, Linux, and collaborative development workflows (including code reviews and pull requests).

  • Strong analytical and problem-solving skills — able to investigate ambiguous data issues, form hypotheses, and validate findings rigorously.

  • Good communication skills — able to document findings in a structured and reproducible manner (Jupyter notebooks, Confluence), and present results clearly to both technical and non-technical stakeholders.

  • A mindset of ownership and curiosity: you take initiative, ask the right questions, and follow through to delivery.