Live Cross-Cloud Migrations With Aeolus and Snap

List overview All Threads
Download

newer

older

Cloud SIG meeting today

Fwd: Updated Cloud Guide draft...

Mo Morsi

3 Nov 2011 3 Nov '11

6:17 p.m.

See the following step-by-step guide (complete w/ screenshots ) [1] that I threw together on how to use Aeolus and Snap [2] to migrate a running instance between two separate cloud providers with no downtime (demonstrated is EC2 / rackspace but this will work to/from RHEV-M or any other cloud provider supported by aeolus/deltacloud [3])

-Mo

[1] http://mo.morsi.org/blog/node/347

[2] https://github.com/movitto/snap

[3] http://incubator.apache.org/deltacloud/drivers.html#h2

Attachments:

attachment.html (text/html — 1.2 KB)

Show replies by date

Richard W.M. Jones

4 Nov 4 Nov

2:59 a.m.

On Thu, Nov 03, 2011 at 09:17:55PM -0400, Mo Morsi wrote:

...

See the following step-by-step guide (complete w/ screenshots ) [1] that I threw together on how to use Aeolus and Snap [2] to migrate a running instance between two separate cloud providers with no downtime (demonstrated is EC2 / rackspace but this will work to/from RHEV-M or any other cloud provider supported by aeolus/deltacloud [3])

-Mo

[1] http://mo.morsi.org/blog/node/347

[2] https://github.com/movitto/snap

[3] http://incubator.apache.org/deltacloud/drivers.html#h2

You mention in the blog posting that you can restore to another Linux distro. But would this really work? How about if there were two different Apache versions, each with a slightly different configuration syntax? Or more plausibly, two different PostgreSQL versions (PG's on-disk format is not compatible across some version changes).

Rich.

-- Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones New in Fedora 11: Fedora Windows cross-compiler. Compile Windows programs, test, and build Windows installers. Over 70 libraries supprt'd http://fedoraproject.org/wiki/MinGW http://www.annexia.org/fedora_mingw

Richard W.M. Jones

5:36 a.m.

On Fri, Nov 04, 2011 at 09:59:39AM +0000, Richard W.M. Jones wrote:

...

On Thu, Nov 03, 2011 at 09:17:55PM -0400, Mo Morsi wrote:

...
See the following step-by-step guide (complete w/ screenshots ) [1] that I threw together on how to use Aeolus and Snap [2] to migrate a running instance between two separate cloud providers with no downtime (demonstrated is EC2 / rackspace but this will work to/from RHEV-M or any other cloud provider supported by aeolus/deltacloud [3])

-Mo

[1] http://mo.morsi.org/blog/node/347

[2] https://github.com/movitto/snap

[3] http://incubator.apache.org/deltacloud/drivers.html#h2

You mention in the blog posting that you can restore to another Linux distro. But would this really work? How about if there were two different Apache versions, each with a slightly different configuration syntax? Or more plausibly, two different PostgreSQL versions (PG's on-disk format is not compatible across some version changes).

Some follow-up questions ..

How large is the snap metadata, ie. the stuff that you copy between the machines? How large would it be given, say, a typical database-backed webserver installation where you might have lots of static contents and some database tables?

Is the metadata in an ad-hoc format and how hard would it be to turn it into a standard format (probably one that we would standardize ourselves)? Can it be useful in other contexts -- eg. could a system administrator look at the output in order to get a definitive list of the changes made to the machine? Could it be useful for auditing? Could the format be diffed?

Rich.

-- Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones virt-df lists disk usage of guests without needing to install any software inside the virtual machine. Supports Linux and Windows. http://et.redhat.com/~rjones/virt-df/

Mo Morsi

5:51 a.m.

On 11/04/2011 08:36 AM, Richard W.M. Jones wrote:

...

On Fri, Nov 04, 2011 at 09:59:39AM +0000, Richard W.M. Jones wrote:

...
On Thu, Nov 03, 2011 at 09:17:55PM -0400, Mo Morsi wrote:

...
See the following step-by-step guide (complete w/ screenshots ) [1] that I threw together on how to use Aeolus and Snap [2] to migrate a running instance between two separate cloud providers with no downtime (demonstrated is EC2 / rackspace but this will work to/from RHEV-M or any other cloud provider supported by aeolus/deltacloud [3])

-Mo

[1] http://mo.morsi.org/blog/node/347

[2] https://github.com/movitto/snap

[3] http://incubator.apache.org/deltacloud/drivers.html#h2

You mention in the blog posting that you can restore to another Linux distro. But would this really work? How about if there were two different Apache versions, each with a slightly different configuration syntax? Or more plausibly, two different PostgreSQL versions (PG's on-disk format is not compatible across some version changes).

Some follow-up questions ..

How large is the snap metadata, ie. the stuff that you copy between the machines? How large would it be given, say, a typical database-backed webserver installation where you might have lots of static contents and some database tables?

One of the nice things that I added to Snap was the ability to ignore static content managed by the package management system. For example when taking a snapshot of the filesystem, only the files modified post installation and the files not tracked by the package system will be backed up and restored.

It should be simple enough to expand upon this concept, adding additional hooks to call out to to determine what exactly should be backed up and restore (hooks to be invoked during the backup / restoration process is already a feature on the project todo list / backlog).

...

Is the metadata in an ad-hoc format and how hard would it be to turn it into a standard format (probably one that we would standardize ourselves)? Can it be useful in other contexts -- eg. could a system administrator look at the output in order to get a definitive list of the changes made to the machine? Could it be useful for auditing? Could the format be diffed?

Right now the snapshot is a simple tarball containing the actual contents of the snapshot and the metadata in XML files. So for example there is a packages.xml file which contains the packages which have been recorded, services.xml containing the services and associated metadata, etc. We can use this as the basis of the standard, easily encapsulating any required information there.

-Mo

Richard W.M. Jones

6:46 a.m.

On Fri, Nov 04, 2011 at 08:51:08AM -0400, Mo Morsi wrote:

...

On 11/04/2011 08:36 AM, Richard W.M. Jones wrote:

...
How large is the snap metadata, ie. the stuff that you copy between the machines? How large would it be given, say, a typical database-backed webserver installation where you might have lots of static contents and some database tables?

One of the nice things that I added to Snap was the ability to ignore static content managed by the package management system. For example when taking a snapshot of the filesystem, only the files modified post installation and the files not tracked by the package system will be backed up and restored.

It should be simple enough to expand upon this concept, adding additional hooks to call out to to determine what exactly should be backed up and restore (hooks to be invoked during the backup / restoration process is already a feature on the project todo list / backlog).

...
Is the metadata in an ad-hoc format and how hard would it be to turn it into a standard format (probably one that we would standardize ourselves)? Can it be useful in other contexts -- eg. could a system administrator look at the output in order to get a definitive list of the changes made to the machine? Could it be useful for auditing? Could the format be diffed?

Right now the snapshot is a simple tarball containing the actual contents of the snapshot and the metadata in XML files. So for example there is a packages.xml file which contains the packages which have been recorded, services.xml containing the services and associated metadata, etc. We can use this as the basis of the standard, easily encapsulating any required information there.

Can you give us some numbers -- how big was the tarball for the migration you did?

If we had a more formal description, then it could be the basis for a useful collection of tools.

snap

puppet manifest snap formal --------> spec for --------> sysadmin libguestfs- VM based tool auditing | ^ | | +------+ p2v, v2v

I think your demonstration only worked with a bit of luck. For v2v we rewrite a lot of configuration files, install virtio drivers etc. In terms of a formalized snap description, that process is a kind of transformation.

How (if at all) does this apply to Windows?

Rich.

-- Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones virt-p2v converts physical machines to virtual machines. Boot with a live CD or over the network (PXE) and turn machines into Xen guests. http://et.redhat.com/~rjones/virt-p2v

Mo Morsi

7:14 a.m.

On 11/04/2011 09:46 AM, Richard W.M. Jones wrote:

...

On Fri, Nov 04, 2011 at 08:51:08AM -0400, Mo Morsi wrote:

...
On 11/04/2011 08:36 AM, Richard W.M. Jones wrote:

...
How large is the snap metadata, ie. the stuff that you copy between the machines? How large would it be given, say, a typical database-backed webserver installation where you might have lots of static contents and some database tables?

One of the nice things that I added to Snap was the ability to ignore static content managed by the package management system. For example when taking a snapshot of the filesystem, only the files modified post installation and the files not tracked by the package system will be backed up and restored.

It should be simple enough to expand upon this concept, adding additional hooks to call out to to determine what exactly should be backed up and restore (hooks to be invoked during the backup / restoration process is already a feature on the project todo list / backlog).

...
Is the metadata in an ad-hoc format and how hard would it be to turn it into a standard format (probably one that we would standardize ourselves)? Can it be useful in other contexts -- eg. could a system administrator look at the output in order to get a definitive list of the changes made to the machine? Could it be useful for auditing? Could the format be diffed?

Right now the snapshot is a simple tarball containing the actual contents of the snapshot and the metadata in XML files. So for example there is a packages.xml file which contains the packages which have been recorded, services.xml containing the services and associated metadata, etc. We can use this as the basis of the standard, easily encapsulating any required information there.

Can you give us some numbers -- how big was the tarball for the migration you did?

2.7MB

This included the mediawiki db dump which was close to 1MB, and inspecting the snapshot I realize that it can be optimized further to reduce a bit of unnecessary cruft.

...

If we had a more formal description, then it could be the basis for a useful collection of tools.
                                          snap

                                          puppet manifest
snap               formal
      -------->     spec for    -------->   sysadmin
libguestfs- VM based tool auditing | ^ | | +------+ p2v, v2v

I think your demonstration only worked with a bit of luck. For v2v we rewrite a lot of configuration files, install virtio drivers etc. In terms of a formalized snap description, that process is a kind of transformation.

Sure thing, the demo was a proof of concept, though quite a bit of refactoring went into making the tool more modular and pluggable so that it can be easily extended and adapted to meet whatever snapshot and restoration needs.

Agree that a formal metadata and api definitions would be very useful to have, added that as a high priority item to the TODO list.

...

How (if at all) does this apply to Windows?

So yes I have been throwing around different ideas of how to accomplish different aspects of the backup/restoration process on Windows

- The repositories bit obviously does not apply, so that snapshot target can be ignored on those systems.

- Packages would only apply in a very limited manner, eg we can record what software is installed and what versions, but even then it would require more end user interaction / intervention in the restoration process.

- Backing up files should be straightforward, albiet without the support of a package system to determine which files are redundant. Snap already supports the user selection of which files to backup / restore and we can combine this with fixed lists of files to ignore for different windows versions

- Services should be simple enough and will work in the same manner as on Linux, eg for windows we know how to backup a postgres db, a running IIS webserver, or whatever other services.

So all in all, snap will be able to work in a consistent manner across linux distros, Windows, and even Mac OSX!

-Mo

Mo Morsi

5:44 a.m.

On 11/04/2011 05:59 AM, Richard W.M. Jones wrote:

...

On Thu, Nov 03, 2011 at 09:17:55PM -0400, Mo Morsi wrote:

...
See the following step-by-step guide (complete w/ screenshots ) [1] that I threw together on how to use Aeolus and Snap [2] to migrate a running instance between two separate cloud providers with no downtime (demonstrated is EC2 / rackspace but this will work to/from RHEV-M or any other cloud provider supported by aeolus/deltacloud [3])

-Mo

[1] http://mo.morsi.org/blog/node/347

[2] https://github.com/movitto/snap

[3] http://incubator.apache.org/deltacloud/drivers.html#h2

You mention in the blog posting that you can restore to another Linux distro. But would this really work? How about if there were two different Apache versions, each with a slightly different configuration syntax? Or more plausibly, two different PostgreSQL versions (PG's on-disk format is not compatible across some version changes).

Rich.

Ah ya, I mentioned in the post that it was theoretically possible but I hadn't tried it out yet. Obviously the implementation details between distros and operating systems would need to be ironed out, but Snap already has mechanisms to make these distinctions and has support for abstract snapshot targets with os-dependent backends tightly coupled to the implementation details of those systems. We would just need to expand upon those, encapsulating package and service metadata in the snapshots.

-Mo

Mo Morsi

5:23 a.m.

On 11/03/2011 09:17 PM, Mo Morsi wrote:

...

See the following step-by-step guide (complete w/ screenshots ) [1] that I threw together on how to use Aeolus and Snap [2] to migrate a running instance between two separate cloud providers with no downtime (demonstrated is EC2 / rackspace but this will work to/from RHEV-M or any other cloud provider supported by aeolus/deltacloud [3])

-Mo

[1] http://mo.morsi.org/blog/node/347

[2] https://github.com/movitto/snap

[3] http://incubator.apache.org/deltacloud/drivers.html#h2

It looks like my site is experiencing some downtime (perfect timing :-/) so I replicated the content on a hosted wordpress account. Its not formatted as nicely but the content is there

http://mrmorsi.wordpress.com/

-Mo

4621

Age (days ago)

4621

Last active (days ago)

cloud@lists.fedoraproject.org

7 comments

2 participants

tags (0)

participants (2)

Mo Morsi
Richard W.M. Jones