AI Performance Software Engineer Job at Signify Technology, San Francisco, CA

SEhVVmtsaUt6RndWMUZhdW9VMDBOSVpJWGc9PQ==
  • Signify Technology
  • San Francisco, CA

Job Description

AI Performance Engineer – CUDA & PyTorch Focus

Location: San Fransisco, CA

Compensation: $200,000-$300,000

A stealth-mode AI systems company is reimagining how large-scale inference is done. With generative AI workloads scaling rapidly, inference efficiency has become a critical bottleneck. We're building an integrated hardware-software platform that brings breakthrough performance and usability to production-scale LLM applications.

This is an opportunity to work on a highly technical team spun out of top-tier academic research, focused on the cutting edge of AI, distributed systems, and performance optimization.

What You’ll Do:

  • Drive core research and implementation of performance optimizations for modern AI models
  • Implement advanced techniques like FlashAttention, KV caching, quantization, and model compression
  • Design and build scalable, distributed compute strategies across GPU-based systems
  • Profile, benchmark, and optimize CUDA kernels and AI runtime performance across inference stacks
  • Work across frameworks like PyTorch, ONNX, and vLLM to improve end-to-end efficiency

What We're Looking For:

  • Strong background in CUDA and low-level GPU performance tuning
  • Proven experience building with PyTorch and deploying high-performance ML models
  • Proficiency in Python and C++
  • Experience with large-scale distributed systems in cloud environments (AWS, GCP, or Azure)
  • Exposure to AI compilers or frameworks like MLIR is a plus
  • Interest in system design, scalability, and accelerating LLM workloads in real production environments

If you’ve spent your time making large models faster, leaner, and more efficient—and want to solve hard technical problems at the core of GenAI infrastructure—this role is for you.

Reach out to learn more.

Job Tags

Similar Jobs

Liberty Health

HOSPICE AIDE - CNA - WEEKEND Job at Liberty Health

 ...HOSPICE AIDE - CNA - WEEKEND Roanoke Rapids-NC-27870-United States Liberty Cares With Compassion At Liberty Hospice we understand the unique needs of our patients and families facing terminal illness. That is why Liberty Hospice provides our... 

CULTIV8, Inc

Marketing Event Coordinator Job at CULTIV8, Inc

 ...We are not your average marketing event company. Located downtown Cincinnati, we aim to increase market penetration and rebuild customer relationships for over 60 Fortune 500 Clients with face-to-face communication. Cincinnati, with its central location and developing... 

Belcan

Aircraft Inspector III Job at Belcan

Job Title: Aircraft Inspector III Location: Hagerstown, MD Zip Code: 21742 Duration: 12 months Job responsibilities: As an Aircraft Inspector III, you will be using your skills and expertise to inspect our aircraft, aircraft systems, equipment, and parts ...

Seneca Resources

Call Center Customer Service Representative Job at Seneca Resources

 ...performance benchmarks. Required Skills Ability to multitask efficiently. Basic proficiency with Microsoft Word, Excel, and Google Workspace. Strong basic math skills (add, subtract, multiply, divide). Excellent verbal and written communication. Strong... 

Travelers Insurance Company

Outside Auto Appraiser (Long Beach) Job at Travelers Insurance Company

 ...+ Prepares and documents accurate vehicle / equipment damage appraisals, Actual Cash and Replacement values according to applicable regulatory...  ...manages work assignments and tracks savings and referrals.+ Reviews and analyzes coverage and apply policy conditions, provisions,...