Skip to content
This repository was archived by the owner on Jul 19, 2023. It is now read-only.
This repository was archived by the owner on Jul 19, 2023. It is now read-only.

Skips large blocks of events during import from CloudWatch? #74

@danielmcquillen

Description

@danielmcquillen

I'm using this (excellent, thanks @lukewaite ) plugin to move a filtered subset of events from one CloudWatch log stream with a years worth of Open edX data into ES for analysis.

About 30k or so should make it through the filter and into ES. I'm using the integer seconds value for start_position last March 1, 2018 (35102038). Every day in the log has at least ten or so events that should make it through the filter.

Everything works fine up until what seems like a random point, where Logstash suddenly jumps to a future date, skipping a month or two of data.

I've tried deleting the index, deleting .since_db and re-running the import, but each time the plugin somehow skips a large block of time somewhere (not the same place) between the start position and current time.

I log out the time of each event that made it through the filter, so on my last run I saw something like:

(...lots of log messages ... )

Found PYSJ SP Event
Event time: 2019-02-14T19:21:40.749596+00:00
 
Found PYSJ SP Event
Event time: 2019-02-14T19:21:41.095490+00:00
 
Found PYSJ SP Event
Event time: 2019-04-11T04:59:57.224356+00:00
 
Found PYSJ SP Event
Event time: 2019-04-11T01:14:25.953122+00:00

(...lots of log messages ..)

Had anyone else experienced this issue? Thanks for any thoughts...

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions