logo

View all jobs

Site Reliability Engineer

New York, NY
Our client, a boutique US HedgeFund, is seeking an SRE who will sit at the center of trading operations and infrastructure to ensure the automation and delivery of productive and efficient trading systems. This is a newly created position which will provide autonomy to identify gaps, improve the whole life-cycle of services-from inception and design, through deployment, operation and refinement of our trading systems. The ideal candidate will demonstrate ownership of the Devops processes through a systematic problem-solving approach and a desire to build robust, cutting edge and scalable systems.

As an SRE, you will:

  • Have a deep understanding of the trading workflow, ensuring its effectiveness across teams.
  • Assist in automating the release and deployment of software.
  • Streamline the build process.
  • Work in cluster environments.
  • Monitor and manage trading workflow, research and trading.
  • Build, streamline, organize and design framework.

Requirements

  • 5+ years of experience in a DevOps/Cluster Engineer role.
  • Bachelor’s Degree in Computer Science.
  • Knowledge of the Linux operating system, permissions, NFS.
  • Effective verbal and written communication skills.
  • Experience managing entire pipelines, working with tools such as Jenkins, Airflow and Ansible.
  • Highly productive in python, bash.
  • Experience in distributed computing/ parallized computing, cluster running computational jobs, developing system in this area.
  • Experience in data recording, storage, and maintenance (backups, redundancy, compression and archiving).
  • Knowledge of CMake.
  • Knowledge of the continuous integration and deployment of code as well as the development toolchain.
  • Hardware and software expertise.
  • Strong problem solving aptitude.

Additional skills/experience that will reflect favorably

  • Experience installing/configuring the ELK stack, including logstash input/output plugins.
  • Experience building Grafana/Kibana dashboards.
  • Familiarity with jupyter notebooks and Docker.
  • Trading operations experience, build environment experience, development experience in python and C++ preferred.
  • Production Environment Monitoring and Maintenance (diagnostics and troubleshooting live trading)
  • Cloud (aws/gcp/azure) and Distributed Computing (slurm, spark, etc)

 
Thank you for illuminating hiring with Quanta Search!

www.quantasearch.com
 

Share This Job

Powered by