opencti performance improvement for exact searches #1048

gelim · 2021-10-12T15:46:38Z

Hello,

Please find a proposal for improving the observables exact search by using the .read() API call with a proper filters set.
A dictionary is used with main Cortex observable types to be translated into OpenCTI key value.

For the moment it supports the following Cortex types: ip, url, domain, mail, hash and filename.
As well the helper function get_hash_type() has been added to do a regex match on Cortex type hash and determine if it's an MD5, a SHA1 or a SH256 in order to translate to the proper OpenCTI observable type.

For any observable type that is not present in the cortex2opencti_types full text search (slow) will be used.
Performance here without the patch for querying one observable is ~10sec, with the patch it is under the second.

Without this, doing batch analyzer runs is prone to failure as the connection between Cortex instance and OpenCTI will easily timeout.

Cheers,
-- Mathieu

ama-gelim added 2 commits October 12, 2021 16:47

Use read+filters method when doing exact search for perf boost

4245db6

Add dict to convert Cortex obs to OpenCTI + secure delete key via pop()

b19f174

nadouani added the category:upgrade label Jan 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

opencti performance improvement for exact searches #1048

opencti performance improvement for exact searches #1048

gelim commented Oct 12, 2021

opencti performance improvement for exact searches #1048

Are you sure you want to change the base?

opencti performance improvement for exact searches #1048

Conversation

gelim commented Oct 12, 2021