Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stream must be replayable to calculate a body hash #1473

Closed
1 task
lon9man opened this issue Nov 22, 2024 · 14 comments
Closed
1 task

Stream must be replayable to calculate a body hash #1473

lon9man opened this issue Nov 22, 2024 · 14 comments
Labels
bug This issue is a bug. documentation This is a problem with documentation. p2 This is a standard priority issue

Comments

@lon9man
Copy link

lon9man commented Nov 22, 2024

Describe the bug

Hello,
trying to upload file to S3 in Jetpack Compose.
Uri for file i got from PhotoPicker.

Regression Issue

  • Select this option if this issue appears to be a regression.

Expected behavior

file uploaded successfully

Current behavior

error: java.lang.IllegalArgumentException: Stream must be replayable to calculate a body hash

Steps to Reproduce

  1. click button "Open Gallery"
@Composable
fun GalleryScreen() {
    var selectedUris by remember {
        mutableStateOf<List<Uri>>(emptyList())
    }

    val multiplePhotoPickerLauncher =
        rememberLauncherForActivityResult(contract = ActivityResultContracts.PickMultipleVisualMedia(),
            onResult = { uris ->
                selectedUris = uris
            })

    fun onOpenGallery() {
        multiplePhotoPickerLauncher.launch(
            PickVisualMediaRequest(ActivityResultContracts.PickVisualMedia.ImageAndVideo)
        )
    }

    Row() {
          Button(onClick = ::onOpenGallery) {
              Text(text = "Open Gallery", color = MaterialTheme.colorScheme.onError)
          }
    }
}
  1. prepare additional parameters (bucket, key, metadata) and run fun upload
private suspend fun upload(bucket: String, key: String, metadata: Map<String, String>, uri: Uri) {
    val inputStream = context.contentResolver.openInputStream(uri)
        ?: throw IllegalArgumentException("Unable to open input stream for URI: ${config.uri}")

    try {
        inputStream.use { stream ->
            S3Client.use { s3 ->
                s3.putObject {
                    bucket = bucket
                    key = key
                    metadata = metadata
                    body = ByteStream.fromInputStream(stream)
                }
            }
        }
    } catch (e: Exception) {
        Log.d("TAG", "$e")
        throw e
    }
}

Possible Solution

No response

Context

No response

AWS SDK for Kotlin version

2.0.21

Platform (JVM/JS/Native)

Native

Operating system and version

Ubuntu 20.04.5 LTS

@lon9man lon9man added bug This issue is a bug. needs-triage This issue or PR still needs to be triaged. labels Nov 22, 2024
@lauzadis lauzadis added documentation This is a problem with documentation. bug This issue is a bug. and removed bug This issue is a bug. needs-triage This issue or PR still needs to be triaged. labels Nov 22, 2024
@lauzadis
Copy link
Member

ByteStream.fromInputStream needs to assume that the given stream is not replayable because any given InputStream may or may not be replayable.

If you have direct access to the file through the uri, you can set the body to: File(uri).asByteStream(), which does not use an InputStream and is correctly marked as replayable.

We can only support replayable bodies for standard SigV4 signing because we need to include the checksum of the body in the request. Calculating the checksum of a non-replayable body would fully consume it, leaving us unable to send it in the request itself.

@lauzadis
Copy link
Member

I've proposed a small fix to infer the replayability of a given InputStream using markSupported(), it won't help you in this case because I believe context.contentResolver.openInputStream returns a FileInputStream which is still not replayable.

smithy-lang/smithy-kotlin#1187

@lauzadis lauzadis added the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 5 days. label Nov 22, 2024
@lauzadis
Copy link
Member

lauzadis commented Nov 22, 2024

We've merged the fix I noted above and it should be available in today's release. Do you have any other questions / concerns?

edit: Fix will be available in Monday's release

@github-actions github-actions bot removed the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 5 days. label Nov 22, 2024
@ianbotsf ianbotsf added the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 5 days. label Nov 22, 2024
@lon9man
Copy link
Author

lon9man commented Nov 25, 2024

@lauzadis

QUESTIONS

I've proposed a small fix to infer the replayability of a given InputStream using markSupported(), it won't help you in this case because I believe context.contentResolver.openInputStream returns a FileInputStream which is still not replayable.

smithy-lang/smithy-kotlin#1187

  1. how this fix should be used? or it is some internal logic, which will be used automatically and will show some error for developer?

If you have direct access to the file through the uri, you can set the body to: File(uri).asByteStream(), which does not use an InputStream and is correctly marked as replayable.

  1. AFAIU uri doesn't have direct access to file. i tried next code to copy into real file. after that it works. BUT maybe exists some better approach to make it without additional copy?
fun fileFromContentUri(context: Context, contentUri: Uri): File {
    val fileExtension = getFileExtension(context, contentUri)
    val fileName = "temporary_file" + if (fileExtension != null) ".$fileExtension" else ""

    val tempFile = File(context.cacheDir, fileName)
    tempFile.createNewFile()

    try {
        val oStream = FileOutputStream(tempFile)
        val inputStream = context.contentResolver.openInputStream(contentUri)

        inputStream?.let {
            copy(inputStream, oStream)
        }

        oStream.flush()
    } catch (e: Exception) {
        e.printStackTrace()
    }

    return tempFile
}

thanks!

@github-actions github-actions bot removed the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 5 days. label Nov 25, 2024
@lauzadis
Copy link
Member

  1. The fix affects internal logic, you don't need to modify your application code to get the change.
  2. You're right, it's very difficult to get a File handle from content URI in Android, I couldn't find anything to suggest. One option is to continue using openInputStream(uri) and then wrap that in a BufferedInputStream which is now supported after that fix I linked earlier.

@lauzadis lauzadis added the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 5 days. label Nov 26, 2024
@lon9man
Copy link
Author

lon9man commented Nov 26, 2024

@lauzadis

One option is to continue using openInputStream(uri) and then wrap that in a BufferedInputStream which is now supported after that fix I linked earlier.

hmm.. i installed aws-sdk-kotlin 1.3.82. using your quote above changed code to:

private suspend fun upload(config: UploadConfiguration) {
        val inputStream = context.contentResolver.openInputStream(config.uri)
            ?: throw IllegalArgumentException("Unable to open input stream for URI: ${config.uri}")

        //var tempFile: File? = null

        try {
            //tempFile = fileFromContentUri(context, config.uri)

            BufferedInputStream(inputStream).use { stream ->
                s3Client.get().putObject {
                    bucket = config.bucket
                    key = config.key
                    metadata = config.meta.getUserMetadata().mapKeys { it.key.code }
                    body = ByteStream.fromInputStream(stream)
                    //body = ByteStream.fromFile(tempFile)
                }
            }
        } catch (e: Exception) {
            throw e
        } finally {
            //if (tempFile != null && tempFile.exists()) tempFile.delete()
        }
}

got new error message (without replayable words) in catch:
A header you provided implies functionality that is not implemented, Request ID: V8MVPD3NVC71C71X, Extended request ID: kUD3E3jwhbqNKuKg20dkzOqfPU5I9TubxopNiCi7w6gbMQZVXY7H1/NCYe8z04da3sP62QOsQwBdgwTljbu89UfwWEaQLrAT

am i doing something wrong?

@github-actions github-actions bot removed the response-requested Waiting on additional info and feedback. Will move to 'closing-soon' in 5 days. label Nov 26, 2024
@lauzadis
Copy link
Member

Are you using a 3rd party object storage like MinIO? That's the only time we've seen errors similar to this. Can you send a full request wire log by configuring LogMode.LogRequestWithBody on your client?

@lauzadis
Copy link
Member

I was able to replicate your error, I don't think BufferedInputStream was the right recommendation. I'm taking a look to see if there's anything better for you to use

@lon9man
Copy link
Author

lon9man commented Nov 27, 2024

@lauzadis

I'm taking a look to see if there's anything better for you to use

thanks for the help.

Are you using a 3rd party object storage like MinIO

no

next is FYI, because behavior is strange)

Can you send a full request wire log by configuring LogMode.LogRequestWithBody on your client

ohh) it was magic trip. i spent several hours to enable logging.
this docs doesn't work for API < 26.
for API 26 it also needs to make adjustments for packaging.resources.excludes (i didn't finish, maybe something else also).
maybe it should be some warnings in docs about possible challenges?

then i installed logback. i ran sources like in previous post.

so my results of wire log. also interesting)

  1. log is enabled
I  18:13:06.791 [DefaultDispatcher-worker-3] DEBUG aws.smithy.kotlin.runtime.http.operation.AuthHandler -rpc="S3.PutObject" sdkInvocationId="60bde336-37f1-421a-bfa1-1869fd252a41"- resolved endpoint: Endpoint(uri=https://s3.us-east-1.amazonaws.com/upload.appname.com, headers=null, attributes={AttributeKey(aws.smithy.kotlin#endpointAuthSchemes)=[AuthOptionImpl(schemeId=AuthSchemeId(id=aws.auth#sigv4), attributes={AttributeKey(aws.smithy.kotlin.signing#AwsSigningRegion)=us-east-1, AttributeKey(aws.smithy.kotlin.signing#AwsSigningService)=s3, AttributeKey(aws.smithy.kotlin.signing#UseDoubleUriEncode)=false})]})
I  18:13:06.794 [DefaultDispatcher-worker-3] DEBUG aws.smithy.kotlin.runtime.http.interceptors.FlexibleChecksumsRequestInterceptor -rpc="S3.PutObject" sdkInvocationId="60bde336-37f1-421a-bfa1-1869fd252a41"- no checksum algorithm specified, skipping flexible checksums processing
I  18:13:06.810 [DefaultDispatcher-worker-3] DEBUG aws.smithy.kotlin.runtime.auth.awssigning.DefaultAwsSignerImpl -rpc="S3.PutObject" sdkInvocationId="60bde336-37f1-421a-bfa1-1869fd252a41"- Calculated signature: 6a35daec350518be878dd88e2e784c9374f1767d5ff6617c83b43e6f3c091aab
I  18:13:06.820 [DefaultDispatcher-worker-3] DEBUG aws.smithy.kotlin.runtime.awsprotocol.ClockSkewInterceptor -rpc="S3.PutObject" sdkInvocationId="60bde336-37f1-421a-bfa1-1869fd252a41"- service did not return "Date" header, skipping skew calculation
I  18:13:06.824 [DefaultDispatcher-worker-3] DEBUG aws.smithy.kotlin.runtime.http.middleware.RetryMiddleware -rpc="S3.PutObject" sdkInvocationId="60bde336-37f1-421a-bfa1-1869fd252a41"- request failed with non-retryable error
D  UploadWorker: java.io.IOException: Stream closed
  1. log is disabled
UploadWorker: aws.sdk.kotlin.services.s3.model.S3Exception: A header you provided implies functionality that is not implemented, Request ID: XNAPR3KWRS6160A5, Extended request ID: dFUeKVH31om2BMX0NAakcZu71j5bXaAawp67I2tXvXiTYBnGMZNkYD9JxG1S0cTCU7QgHpHPGjM=

but we can't see SDK logs, because they are disabled))

@RanVaknin RanVaknin added the p2 This is a standard priority issue label Nov 27, 2024
@lon9man
Copy link
Author

lon9man commented Dec 10, 2024

Hello @lauzadis,

I'm taking a look to see if there's anything better for you to use

did you get some results?

@LibertyPaul
Copy link

LibertyPaul commented Dec 23, 2024

I'm observing a related issue when trying to make a general solution for streams and buffers.

If I run the below code - I get aws.smithy.kotlin.runtime.http.HttpException: java.net.ProtocolException: unexpected end of stream error before the request is sent to S3.

❌ Problematic example:

@Test
fun minimalFailingTest(): Unit = runBlocking {
    val data = "Hello world!".toByteArray()
    val request = PutObjectRequest {
        bucket = BUCKET_NAME // Already created
        key = "minimal-test.txt"
        body = data.inputStream().asByteStream(data.size.toLong())
    }

    s3Client.putObject(request)
}

Also tried body = ByteStream.fromInputStream(data.inputStream(), data.size.toLong()) -- with the same outcome.
Obviously, the underlying stream, ByteArrayInputStream is replayable.

If I pass the body as ByteStream.fromBytes(data) - everything works.
✅ Successful example:

@Test
fun minimalSuccessfulTest(): Unit = runBlocking {
    val data = "Hello world!".toByteArray()
    val request = PutObjectRequest {
        bucket = BUCKET_NAME // Already created
        key = "minimal-test.txt"
        body = ByteStream.fromBytes(data)
    }

    s3Client.putObject(request)
}

I use the latest SDK version at the moment: 1.3.99 (with Kotlin 2.0.20), and I see that the version SDK already has the fix with override val isOneShot: Boolean = !markSupported() (can see this line in the source code).

Any idea why this could happen?
Please let me know if it needs to be put as a separate issue.

@ianbotsf
Copy link
Contributor

ianbotsf commented Jan 3, 2025

@lon9man We've made some improvements to our stream wrapping to better handle replayable streams (such as file streams or Android content streams). These changes are staged for the release after today's release (v1.3.106, tentatively scheduled for Monday).

Note that S3 PutObject requires an explicit content length, which you will need to provide when wrapping the stream:

context.contentResolver.openAssetFileDescriptor(uri, "r").use { file ->
    file.createInputStream().use { stream ->
        s3.putObject {
            bucket = bucket
            key = key
            metadata = metadata
            body = ByteStream.fromInputStream(stream, file.length)
        }
    }
}

@LibertyPaul I've reproduced the error and confirmed that the upcoming release will resolve your issue too due to the improved stream wrapping mentioned above.

@ianbotsf
Copy link
Contributor

ianbotsf commented Jan 7, 2025

SDK v1.3.106 has been released with the improvements to InputStream wrapping so I'm resolving this issue. Please reopen or file a new issue if you're still encountering issues related to InputStream.

@ianbotsf ianbotsf closed this as completed Jan 7, 2025
Copy link

github-actions bot commented Jan 7, 2025

⚠️COMMENT VISIBILITY WARNING⚠️

Comments on closed issues are hard for our team to see.
If you need more assistance, please either tag a team member or open a new issue that references this one.
If you wish to keep having a conversation with other community members under this issue feel free to do so.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug This issue is a bug. documentation This is a problem with documentation. p2 This is a standard priority issue
Projects
None yet
Development

No branches or pull requests

5 participants