Add faster, architecture dependent memcpy()