DevOps Rescue Story: Recovering an EC2 Instance Without a PEM Key
Mojahid Ul Haque
DevOps Engineer
"Lost PEM? No SSH? SSM dead? Don't panic — AWS always leaves a backdoor for those who know where to look."
Yesterday I ran into one of those heart-sinking moments: an EC2 instance was completely locked out. - PEM key gone → SSH impossible - SSM agent broken → root volume full, wouldn't start even after EBS expansion - EC2 Instance Connect failing
Basically… the instance was bricked. Or so it seemed.
The Recovery Playbook I Followed
- Spun up a helper EC2 instance with a fresh key pair.
- Detached the root volume from the locked instance → attached it to the helper.
- Mounted the volume → discovered the partition still capped at 100GB even though the EBS size was already 150GB.
- Ran growpart + resize2fs → filesystem finally stretched to the full 150GB. (49GB free instantly.)
- Cleared old logs and temp files for breathing room.
- Added a new SSH public key into ~/.ssh/authorized_keys.
- Detached the fixed root volume → reattached it back to the original instance.
- Rebooted → boom! SSH worked with the new PEM, and the SSM Agent sprang back to life.
Originally posted on LinkedIn
View original postRelated Posts
How I reduced AWS networking costs by 93% while removing public attack surface
I recently tackled a common but expensive challenge in AWS: the hidden cost of public IPv4 addresses. In a setup with dozens of ECS Fargate tasks, my "In-use Public IP" charges...
AWS ECS Mumbai has mood swings - DevOps engineer perspective
As a DevOps engineer, I've basically accepted that AWS ECS Mumbai has mood swings. Once or twice a month, it just… decides it's done with life. Deploy? Maybe. Pull images? If it...
Stop Leaving AWS Credits Unclaimed - That Outage Might've Owed You Money
Remember the AWS outage on October 20th? Six hours down. Over 100+ services affected. Millions of users impacted. Everyone's talked about the RC multi-region setups, and resilie...