Your key responsibilities:
Participate in managing the infrastructure that runs soundcloud.com to help provide very high uptime and performance in a 24×7 environment.
Manage and lead systems deployment and upgrade efforts of varying size and complexity.
Drive improving our automation in system provisioning and hardware management.
Participate in designing and building a wide variety of tools to help support the infrastructure.
Provide extensive failure and recovery analysis to create tools and documentation.
Participate in on-call rotation.
Required skills and experience:
4+ years professional Linux system administration, in large data center environments serving a high traffic destination site.
Experience with configuration management systems such as Chef or Puppet.
Experience with all aspects of automating hardware provisioning.
Great shell programming skills.
Experience with network engineering and strong knowledge of IPv4 and IPv6 Networks.
Proven excellent communication skills, both verbal and written.
Experience with varnish, nginx, haproxy, memcached.
Experience using Git.
Experience with Amazon Web Services.
Experience managing database/datastore clusters, such as Mysql, Cassandra, Hadoop.
Experience with Ruby and preferably at least one more language.
Experience with monitoring systems like Graphite, Ganglia, Nagios, New Relic, Hoptoad.
Experience with CDNs and network caching technologies.
SoundCloud is the world’s leading social sound platform with more than 12 hours of music and audio uploaded every minute with a reach of more than 250 million people. SoundCloud is the online place for discovering compelling and engaging music and audio.