
This full-time AI Performance Optimization Engineer role is based within Bright Vision Technologies' in-house engineering team and operates 100% remotely across the United States. The position focuses on maximizing throughput, minimizing latency, and reducing costs for large-scale neural network training and inference systems. Key responsibilities include profiling and optimizing end-to-end AI pipelines, implementing model compression strategies like quantization and pruning, and driving compiler-level improvements using tools such as Triton and XLA. The role appeals to candidates seeking deep technical impact in a collaborative environment where they can mentor junior engineers and shape production best practices. The company offers a direct W2 employment structure with long-term project alignment and a culture that values rigorous data-driven engineering and continuous innovation in AI systems.




















