Advisory Engineer, Hardware HPSC
Ricoh Americas Corp.
Posted Friday, August 30, 2024
Posting ID: 300006327780762_crt:1725019552776
Boulder, CO
Advisory Engineer, Hardware HPSC
POSITION PROFILE
The above statements are intended to describe the general nature and level of work being performed by people assigned to this job. They are not intended to be an exhaustive list of all responsibilities, skills, efforts or working conditions associated with a job.
Develops plans for new or enhanced products, technologies or processes consistent with division product plans/strategies resulting in new and original products, devices, components, equipment, scientific application or improved production processes.
Provides technical guidance to other professionals and/or acts as team leader on special projects. Maintains current knowledge of the marketplace, key competitors and the business/technological environment.
Limited direction and general instruction on new assignments. Work evaluated upon completion.
Job Title: Control Unit Hardware Engineer
Job Description:
As a Control Unit Hardware Engineer, you will play a crucial role in designing, building, and maintaining high-performance computing rack clusters. Your responsibilities will span across hardware and software development, installation automation, and collaboration with cross-functional teams. Here are the key duties:
- Design and Development:
- Collaborate with colleagues to build and certify high-performance computing rack clusters.
- Design, develop, and maintain Linux kernel modules and extensions.
- Troubleshoot and resolve complex technical issues related to the Linux operating system to meet SLAs
- Construct racks for the Test-team and set up racks for EMC certification.
- Software and Tools:
- Develop and maintain a suite of Linux-based tools for server inventory, firmware updates, and hardware monitoring.
- Program Python and Bash scripts to communicate with server management processors in OpenSUSE environments.
- Optimize system resource utilization, including memory management and scheduling.
- Deploy custom Linux images to automate server firmware distribution and validation at scale for both internal and external customers.
- Driver Development and Testing:
- Build PCIe Ethernet adapter drivers from source for various Linux distributions.
- Test these drivers using virtual machines and Docker images.
- Collaboration and Support:
- Work closely with the Basic Operating System (BOS) team to ensure server BIOS configurations align with BOS requirements and microcode.
- Provide the BOS team with accurate hardware documentation and configurations.
- Manage the Redmine issue-tracking system to troubleshoot and document hardware failures, including CPU, memory DIMMs, system boards, power supplies, HDDs/SSDs, and PCIe adapters.
- Perform frequent firmware updates and BOS installations.
- Technology Evaluation and Decision-Making:
- Evaluate the latest server technologies, including Intel 100Gb NICs, AMD CPUs and GPUs, and Dell RACADM.
- Collaborate with code development, quality assurance, and product engineering teams to define product requirements, testing, release, and Field support.
- Communication and Continuity:
- Provide other teams written instructions on firmware updates and hardware usage in the Field.
- Participate in decisions related to supplier concerns, end-of-life buys, and capital planning to ensure product continuity while remaining cost-effective.
- Directly engage with the manufacturing 3 rd -party teams to resolve release issues and validate every server rack shipped.
Qualifications:
- Bachelor's degree in Computer Science, Electrical Engineering, or related field.
- Proficiency in Linux, C, Python and Bash scripting.
- Strong understanding of Linux operating system internals (10+ years).
- Experience with Linux kernel extension development.
- Familiarity with Linux boot process and initialization.
- In-depth knowledge of memory management within the Linux kernel.
- Expertise in multi-threaded programming, resource locking, and handling race conditions.
- Experience with server hardware, firmware, and networking.
- Strong problem-solving skills and attention to detail.
- Excellent communication and collaboration abilities.
The above statements are intended to describe the general nature and level of work being performed by people assigned to this job. They are not intended to be an exhaustive list of all responsibilities, skills, efforts or working conditions associated with a job.
The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.