Learning from protein data with protein language models