I started using Floyd Hub a few days ago and I'm really loving the convenience compared to spinning up EC2 instances.
But one feature I think would improve the platform is being able to access AWS s3 bucket data and Google Cloud buckets. This would be great for very large datasets that might otherwise take days to download and upload on a slow connection.
This is how I imagine it would work from a users perspective:
1. Create new dataset
2. Select dataset source (options might be s3, Google Cloud Bucket, etc)
3. Enter bucket URL and credentials. (This might create a link from the dataset to the bucket or copy it)
4. Read data from bucket in Python, Jupyter etc..